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Preface 


This book is exactly what its title claims to be. It is an invitation to 
students and researchers in different areas of physics to share the fun and 
excitement of astrophysics. 

During the last two decades, technological advances have allowed us 
to observe the cosmos with unprecedented accuracy, making astrophysics 
and cosmology branches of science driven forward by observations. This 
excitement and growth has attracted more people to enter this area and, 
given the current enthusiasm, it is inevitable that astrophysics will continue 
to be a leading branch of science in the early part of this millennium. 

This subject has also fascinated the layman all through the ages, re- 
sulting in a proliferation of popular articles and books. Every branch of 
astrophysics, of course, has excellent text books authored by the experts. 
There are also a few comprehensive text books at an advanced level in- 
tended for serious researchers in the field. In spite of all these, currently 
available literature leaves a gap which the present book is designed to fill. 
This book will be useful, for example, to a theoretical physicist who wants 
to have a quick overview of the entire subject of astrophysics — and appre- 
ciate the excitement — but without getting lost in the details. In a way, it 
is expected to serve as ‘a guide to astrophysics for an intelligent outsider’; 
maybe a researcher in theoretical physics, a student of physics or even an 
observational astronomer who wants to dabble a bit with the theory. 

I have designed the book keeping the above perspective in mind. To 
begin with, it is by no means expected to be comprehensive. For that, 
there are other books (including the three volume course of astrophysics, 
running to about 1800 pages, authored by me and published by Cambridge 
University Press). The choice of topics here stresses both the foundations 
and the frontiers, moving lightly over anything in between. 
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Second, I have made sure that the book is self contained as far as astron- 
omy and astrophysics are concerned. For example, astrophysics, like every 
other subject, has a collection of jargon specific to it and I do not expect 
the reader to have any familiarity with this jargon. Astronomical jargon is 
explained in the language of the physicist so that the book is accessible to 
a wider community. The same criterion is used in the choice of theoretical 
foundation that is laid in the first three chapters. These three chapters deal 
with those areas of theoretical physics which are widely used in astrophysics 
but might require some amount of brushing up for an average student of 
theoretical physics. For example, bremsstrahlung or ionization equilibrium 
or the propagation of a shock wave may not be a part of every researcher’s 
arsenal even though they might have learnt basic electrodynamics or fluid 
mechanics. The first three chapters gather together several such topics in 
order to make this book self contained. At the same time, I have not tried 
to teach the basic areas of theoretical physics; I expect the reader to know 
the basics of classical and quantum mechanics, special relativity, electro- 
dynamics, etc. (But not general relativity, which is introduced briefly in 
Chapter 1.) 

Third, I have added a fair number of exercises at the end of each chapter 
which will provide some amount of practice for the physicist in applying 
the concepts to astrophysical contexts. In particular, there are quite a few 
problems titled ‘Real Life Astronomy’ which take the reader directly to 
published literature. 

Fourth, I have kept the algebra to the minimum commensurate with 
clarity. Whenever the derivation of an equation is straightforward (even 
though lengthy), I have merely indicated how the result can be obtained 
rather than hold the reader by hand and take him/her through the algebra. 
I think this allows for a crisper pace and those who are interested in the 
key concepts and results can always take algebra for granted. 

Finally, I have tried to keep the discussion in the later chapters fairly 
topical. The emphasis is on research areas which are currently popular 
rather than on more traditional topics. This has the disadvantage that 
some of the ideas might change a bit over the next decade or so but the 
advantage of conveying the current excitement in the frontier areas more 
than compensates for this. 

The cognoscenti will find that many of the derivations of even stan- 
dard results (Kepler problem, Compton scattering, Kompaneets equation, 
Schwarzschild metric, gravitational lensing deflection ...) are different from 
the ones usually found in textbooks. Hopefully, even the experts find some- 
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thing new and enjoyable in this book. I have also not shied away from 
discussing some of the topics usually considered to be rather advanced but 
the physical content of which is quite simple (for example, the concept of 
black hole temperature). 

The idea for this book originated from several colleagues who suggested 
to me that an introductory text, to supplement my three volume course 
in astrophysics, will be a useful addition to the literature. I was initially 
hesitant since such a book will necessarily overlap to a certain extent with 
the existing textbooks in the market, including mine. However, many peo- 
ple convinced me that there is really a need for such a book and some 
amount of overlap will do no harm. I am thankful to J.V. Narlikar, who is 
the editor of this series published by World Scientific, for an enthusiastic 
and positive response. I thank L. Sriramkumar for reading the manuscript 
and giving extensive comments. I have also benefited from my discussions 
with K. Subramanian, R. Nityananda, D. Lynden—Bell and the members 
of the Astronomy Division of Caltech regarding several aspects of peda- 
gogy related to astrophysics. In particular, K. Subramanian tried out parts 
of the material in the book in an introductory astrophysics course he was 
teaching at Pune University. I am also grateful to R. Gupta, H.M. An- 
tia, C.C. Steidel and Jim Kneller for providing the data which were used 
in generating some of the figures. As I have already mentioned, it is im- 
possible to avoid certain amount of overlap in the discussion with other 
existing text books including my own Theoretical Astrophysics Vol I-III 
published by Cambridge University Press. I have referred to this and other 
text books wherever appropriate in the Notes and References at the end of 
each chapter. 

It was a pleasure working with Ms Magdalene Ng and her World Scien- 
tific team in this project and I thank them for excellent processing of the 
book. 

I thank Vasanthi Padmanabhan for dedicated support and for taking 
care of the entire latexing, production of figures and for formatting the 
text. It is a pleasure to acknowledge IUCAA for the research facilities and 
ambience it has provided. 


T. Padmanabhan 
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Prologue 


Astrophysics involves the application of laws of physics to understand the 
cosmic structures which, broadly speaking, include anything bigger than 
the planets. The purpose of this prologue is to give you a bird’s eye view of 
these structures and introduce some essential jargon. The scales of physical 
variables, like mass, length, time, temperature, etc. which we encounter in 
astrophysics are summarised in the Appendix at the end of the book. 

The description of astrophysical structures could begin either from small 
scales and proceed to the large scales or the other way around. We will pro- 
ceed from the largest scale downwards. You would certainly know — even 
if you are a rank outsider to astronomy — that our universe is expanding. 
The current rate of expansion (called Hubble constant) is measured to be 
about Hp = 0.3 x 1071h s7! where h & 0.72 is a dimensionless factor 
of order unity that parameterizes the actual rate of expansion. This gives 
you a characteristic time scale, tuniy = Hp’ ~ 10/°h~1 years, that can be 
associated with our universe. This is typically the largest time scale over 
which cosmic evolution takes place. Multiplying it by the speed of light, 
one can construct a length scale ctyniy associated with the universe which is 
roughly the size of the visible universe today. Numerically, ctuniy © 1078A7! 
cm and cosmologists usually write this as ctuniy ~ 3000h~1 Mpc where 
1 Mpc = 3 x 104 cm is a convenient unit for extragalactic astronomy. (We 
will also use 1 kpc =% 3 x 107! cm and 1 pc æ% 3 x 108 cm in describing 
galactic and sub-galactic scales.) 

The rate of expansion Hg has the dimension of frequency. You know 
that, if p is a mass density, /Gp also has the dimensions of frequency. 
Therefore, by introducing the Newtonian gravitational constant G, you can 
define a characteristic density, pc, from the rate of expansion Hp. It is 
conventional (and useful) to define it with an extra numerical factor, as 
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pe = 3H? / (8TG) = 1.88h? x 10-79 gm cm~%. Incredibly enough, the actual 
density of our universe is very close to this value. 

An expanding universe would have been smaller, denser and hotter in 
the past and — at sufficiently early moments — matter would have existed 
as a primordial soup of elementary particles. Observations show that our 
universe is embedded in a thermal radiation — called Cosmic Microwave 
Background Radiation (CMBR) — at a temperature of about 2.73 K (which 
corresponds to an energy scale of kgT ~ 2 x 1074 eV using kp = 0.86 x 
10-4 eV K~!). In the past, the temperature would have been higher, 
increasing essentially as the inverse power of the size of the universe. When 
kgT is higher than the energy scale Ey; = (hc?/G)!/? = 10!° GeV — 
which can be obtained from the three fundamental constants — quantum 
gravitational effects would have been important in the universe. We have 
no clue about this phase but, fortunately, this ignorance doesn’t seem to 
affect our ability to predict what happens in the universe as it cools to more 
moderate energies. 

Two events in the later life of the universe are of deep significance (see 
Chapter 6). First, when the universe was about a minute old, it was cool 
enough to allow the protons and neutrons to come together and synthe- 
size some amount of deuterium and helium (but not significant amounts 
of heavier elements). From the observed abundance of these primordial 
elements, one can put a bound on the amount of baryonic matter (that is, 
protons and neutrons) that can exist in the universe. This turns out to 
be a fraction Qg % 0.05 of the critical density pe introduced earlier. Since 
other observations show that the mean density of the universe is actually 
about pc, most of the matter in the universe must be made of non baryonic 
matter. In fact, observations suggest that nearly 30 per cent of the matter 
is made of Weakly Interacting Massive Particles (WIMPs), usually called 
dark matter, while the remaining constituent is in a more exotic form of a 
fluid that exerts negative pressure (called dark energy). Understanding the 
nature of this dark energy is the greatest theoretical challenge in cosmology 
today. 

Second, when the size of the universe was about 10° times smaller than 
the current size, it was cool enough for the electrons to combine with pro- 
tons and helium nuclei to form a neutral gaseous mixture of hydrogen and 
helium. The formation of the neutral atoms allowed the radiation to decou- 
ple from matter and flow freely without significant scattering or absorption 
till today. It is this relic radiation that we observe as the CMBR mentioned 
earlier. Clearly, CMBR will contain a fossilized record of the state of the 
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universe when it was 10° times smaller. Direct observations of CMBR now 
tells us that — at that time — the matter in the universe was distributed 
very smoothly but with tiny fluctuations in energy density amounting to 
about 1 part in 10°. The detection of these tiny fluctuations, in the last 
decade or so, has been a high point in our understanding of the cosmos. 

As the universe evolved further, these tiny fluctuations in the density 
grew by gravitational instability and eventually formed the different struc- 
tures which we see in the universe. The dark matter particles cluster gravi- 
tationally to form large, virialized, self-gravitating lumps in which baryonic 
matter gets embedded in the form of, say, galaxies. In fact, direct obser- 
vations of the gravitational potential surrounding the visible part of the 
galaxies show that galaxies are embedded in large dark matter halos which 
are probably at least ten times as massive. The normal matter located 
within these dark matter halos radiate energy, cool and sink to the cen- 
tral regions of the halos finally reaching a size of about 15 kpc in radius 
(see Chapters 7 and 8). Incidentally, one can obtain this mass scale of the 
galaxy entirely in terms of fundamental constants: 


ee. 
M, = a,,’a° (z) Mp ~ 3 x 10*4gm 


e 


where ag = (Gm3/ħc) ~ 1078 and a = (e?/ħc) ~ 107? are dimensionless 
constants. Clearly, the scale of galaxies is no accident but is determined by 
physics. 

The formation of galaxy like structures took place from the time when 
the universe was about a tenth of the present size. In fact, even larger struc- 
tures — which are gravitationally bound — exist in the universe. Groups 
of galaxies may typically consist of about (5-100) galaxies; a system with 
more than 100 galaxies is conventionally called a cluster. The sizes of groups 
range from a few hundred kiloparsecs to few Megaparsecs. Clusters have a 
size of typically a few Megaparsecs. The gravitational potential energy is 
comparable to the kinetic energy of random motion in these systems. The 
line of sight velocity dispersion in the groups is typically 300 km s~! while 
that in the clusters can be nearly 1000 km s~!. About ten percent of all 
galaxies are members of large clusters. In addition to galaxies, clusters also 
contain very hot intra-cluster gas at temperatures (107 — 108)K. 

The lumps of gas within the galaxies will further undergo gravitational 
contraction and when the temperature at the central region of such a con- 
tracting gas cloud is high enough to ignite nuclear reaction, it will reach 
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a stable configuration in the form of a star (see Chapter 4). The typical 
mass of a star is 1 Mọ % 2 x 1033 gm (which is the mass of the Sun) and 
the radius is about Ro ~ 7 x 101° cm. Again, this scale is determined by 
fundamental constants: 


m.\3/4 / @ \3/2 
M, = (z2) (=) Mp ~ 10°? gm. 


Me QG 


For comparison, the mass and radius of the Earth are Mg = 6 x 102” gm 
and Rẹ = 6 x 10° cm, respectively. (Astronomers use the subscript © for 
matters pertaining to Sun and @ for matter pertaining to Earth.) From 
the previous estimate, it is clear that a typical galaxy contains about 10!! 
to 10!? stars. The mean distance between galaxies in the universe is about 
1 Mpc while the mean distance between the stars in the galaxy is 1 pc. 

The first generation of stars (which are called Population II or Pop II 
stars) will essentially be made from a cloud of hydrogen and helium. Nu- 
cleosynthesis in the hot stellar furnace leads to the formation of heavier 
elements inside the star. When the nuclear fuel is exhausted, the stars die 
and become stellar remnants which could take different forms (see Chapter 
5). For example, high mass stars end their life in spectacular explosions 
(called supernova) which are the brightest optical events in the sky. The 
supernova explosion spews out the heavier elements synthesized inside the 
stars on to the interstellar medium of the galaxy from which the next 
generation of stars (usually called Pop I) form. What is left behind in a 
supernova explosion will be a rotating neutron star usually called a pulsar. 
Less massive stars may end their life leaving behind a remnant called white 
dwarf. Both white dwarfs and neutron stars are quantum mechanical ob- 
jects in the sense that the gravitational attraction is balanced by quantum 
degeneracy pressure of electrons or neutrons in these objects. It is one of 
the many examples in which microphysics dictates the form of large scale 
structures in the universe. And the really high mass stars leave behind a 
black hole as the remnant. 

Since stars form in collapsing gas clouds, they very often form in pairs 
or larger groups. In fact, some of the oldest structures in the universe are 
structures known as globular clusters which are conglomerations of about 
10° stars. Our galaxy, for example, contains several of them distributed 
in a roughly spherical manner. When the stars form in pairs (binaries, as 
they are called) it is quite possible for one of them to evolve faster and 
leave behind a remnant (white dwarf, neutron star or a black hole) which 
will acrete matter from the second star leading to high energy radiation. 
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Several such systems, often consisting of a star with an invisible companion, 
have been observed. 

Accretion of matter, allowing gravitational potential energy to be con- 
verted into radiation, is an extraordinarily efficient process and works in 
another class of systems, called active galactic nuclei (see Chapter 9). These 
are galaxies, which harbor massive (> 107 Mo) black holes in their centers. 
Matter accreting to them produces luminosity exceeding that of an entire 
galaxy from a region smaller in size than the solar system! Many such sys- 
tems are known and they have existed even when the universe was about 
five time smaller in size. The farther we can see, the earlier we can probe, 
because of the finite time taken by the light to traverse the distance; AGN 
can be seen from very large distances and hence can be used to probe the 
universe when it was considerably younger. 

The formation of galaxies by gravitational instability cannot, of course, 
be one hundred per cent efficient and one would expect fair amount of gas 
to be left in between the galaxies in the form of an intergalactic medium. 
Observations suggest that the intergalactic medium is ionized to a high 
degree, possibly from the radiation emitted by the first structures, which 
formed in the universe when it was smaller. 

The above summary illustrates the diversity of structures we encounter 
in astrophysics. The density varies from 10~28 g cm~® in the intergalactic 
medium to 104 g cm~? in a neutron star; the sizes vary from a 10° cm for 
the radius of a planet to 1075 cm for a large galaxy cluster; the distance 
scales range from a few parsecs in the interstellar medium to a few thousand 
Megaparsecs in the size of the universe; the masses range from 10°? gm 
for a typical star to about 104” gm for a cluster of galaxies. Consequent 
of this variety is the fact that matter exists in widely different forms in 
the astrophysical systems: ionized plasma, neutral gas, solid of moderate 
density and material with nuclear densities. Clearly, one requires a wide 
variety of input physics to understand these phenomena. 

In this book we will attempt to cover all these phenomena (and more) 
from the point of view of a curious physicist. The first three chapters 
provide the theoretical background describing a key processes in gravity, 
theory of radiation, fluid mechanics etc. which are extensively used in 
astrophysics. Chapters 4 and 5 take up the physics of stars and stellar 
remnants which is the corner stone of astrophysics; star formation, stellar 
evolution, white dwarfs, neutron stars, supernova and supernova remnants, 
binary stars ..... all of them find their place here. In Chapters 6 and 7 we 
move to the other end of the scale. Chapter 6 discusses the current ideas in 
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cosmology and Chapter 7 builds this further to describe the formation of 
galaxies. The actual description of galaxies as physical systems is presented 
in Chapter 8. The last chapter covers the more exotic type of galaxies, viz., 
the active galactic nuclei. 
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Chapter 1 


Gravitation 


Given the fact that gravity plays the key role in astrophysics — at all scales 
from planets to the universe — it makes sense to begin our discussion with 
a study of gravitational physics. Though the correct description of gravity 
requires general relativity, you can get by, most of the time, with the simpler 
Newtonian approximation based on three assumptions: 

(i) Gravity obeys the principle of superposition: If particle 1 and particle 
2 individually produce gravitational fields g, and go, then the gravitational 
field produced in the presence of both the particles is just the sum gı + 
g2. So if we know the gravitational field dg at a location x due to an 
infinitesimal amount of matter dM = p(t,x’) dx’ located around x’, we can 
find the gravitational field produced by any mass distribution by integrating 
over the volume. 

(ii) So all we need to know is the gravitational field at x, produced by an 
infinitesimal amount of matter dM at x’. Following the footsteps of New- 
ton, we will take this to be dg = —G(dM)(#/r?) = —Go(t,x’) da’ (#/r?) 
where r = x — x’, which is just the inverse-square law for the gravitational 
force. The total field due to any mass distribution can then be found by 
the integral: 


x— x’ 
This force can be expressed as a gradient of a potential: g = —Vx@ where 


the gravitational potential due to the density distribution p(t,x) is taken 
to be 


Gplt,x’) 

=- | Ër ———. 1.2 
gi) =- fet POX (1.2) 
The equation g = —Vx¢ by itself does not fix the normalization for ¢ 
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and you can add any constant to it without changing g. But the explicit 
definition in Eq. (1.2) fixes it: with this definition, ¢ — 0 as |x| — oo if 
the density distribution falls faster than |x|~? at large distances. Using 
the identity V?(1/|x|) = —4rôp(x), where dp is the Dirac delta function 
(which is most easily proved by integrating both sides over a spherical 
region of radius r) we see that Eq. (1.2) also implies the Poisson’s equation: 
V2 = 4nGp. 

(iii) To complete the story, we need to know how particles move in a 
given gravitational field. We will assume that a gravitational field g exerts 
a force f = mg on a particle of mass m. 

These simple and familiar rules of Newtonian gravity hide several non- 
trivial features. First, note that particles produce (and respond to) grav- 
itational field with a strength proportional to their masses. There is no 
a priori reason why they should! For example, particles produce (and re- 
spond to) electric field with a strength proportional to electric charge q 
so that the equation of motion for a charged particle in an electric field 
is ma = qE. Here m is the inertial mass of the particle which occurs in 
Newton’s second law F = ma in which the acceleration a could have been 
produced by any force F. One can easily imagine the existence of a “grav- 
itational charge” qg so that the gravitational force scales in proportion to 
qg with the equation of motion for a particle in a gravitational field being 
ma = qg. But this is not the case. The mass m which occurs in the 
equation for the gravitational force, f = mg, is the same mass m which 
occurs in the Newton’s second law. An immediate consequence is that the 
acceleration experienced by particles with different values of m is the same 
in a given gravitational field. (This fact — which is one version of principle 
of equivalence — is a key ingredient in Einstein’s general relativity which 
describes gravity as curvature of space time; see Sec. 1.5.) 

Second, the gravitational potential @(t,x) (and the force) at time t 
depend on the density distribution p(t,x) at the same time t; that is, New- 
tonian gravity acts instantaneously. If the Sun disappears, its effect on 
Earth will vanish at the same instant. Obviously, this will not be true in a 
relativistic theory in which signals cannot propagate at speeds greater than 
that of light. If the Sun disappears, it should take at least about 8 minutes 
for the effect to be felt on Earth. General relativity takes care of this but, 
as we said before, we can ignore this feature in the nonrelativistic limit. 

And, finally, the principle of superposition — something you might have 
thought is obvious — is not true in the correct description of gravity based 
on general relativity. This is because relativity demands that all energies 
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contribute to gravity including the energy of the gravitational field itself; 
this makes the theory non linear. So, even if you know the field produced by 
two particles individually, you cannot find the field when both are present 
by just adding them up. You need to go and solve Einstein’s equations — 
which is the analogue of Poisson’s equation —- again. This is one reason 
why we have only a handful of known exact solutions in general relativity. 

What is the necessary condition for the validity of the non relativistic 
description of gravity? The gravitational potential in Eq. (1.2) has the 
dimension of square of velocity and we can use Newtonian gravity as long 
as |¢| < c. In the case of a particle of mass M located at the origin, the 
gravitational potential at a distance r is |¢| = GM/r = v? where v is the 
speed of a particle in a circular orbit at the radius r. The quantity, |¢|/c? = 
GM/(c?r) = (u/c)? is, therefore, a measure of relativistic effects in the 
system. Newtonian gravity can be trusted for v? < c?, that is at distances 
r >> rg = 2GM/c*. The length scale rg is called the Schwarzschild radius 
for a particle of mass M and the factor 2 is included in order to agree with 
exact definition in general relativity (which we will discuss in Sec. 1.5). 
Numerically, rg ~ 3 km (M/Mo) where M = Mọ & 2 x 10° gm is the 
mass of the Sun. Clearly, rg is tiny compared to the size of the Solar 
system, or for that matter, the size of the Sun itself. Hence, Newtonian 
gravity works reasonably well in the Solar system. 


1.1 Orbits in Newtonian gravity 


The simplest problem in Newtonian gravity is that of a one-body problem: 
say, that of a planet moving around the Sun, with the Sun assumed to stay 
fixed at the origin. It turns out that the trajectory of a particle, moving 
under the attractive inverse square law force, is a circle (or part of a circle) 
in the velocity space. The proof is quite straightforward. Start with the 
text book result that, for particles moving under any central force f(r), the 
angular momentum J = r x p is conserved. (Here r is the position vector, 
p is the linear momentum and f is the unit vector in the direction of r. The 
conservation law arises because in dJ/dt, the dr/dt is in the direction of p 
and the dp/dt is in the direction of r for the central force). The constancy 
of J implies, among other things, that the motion is confined to plane 
perpendicular to J. Let us introduce in this plane the polar coordinates 
(r,0) and the Cartesian coordinates (x,y). The conservation law for J 
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implies that r26 is a constant giving 
Å = constant/r? = hr™?. (1.3) 


(Incidentally, this is equivalent to Kepler’s second law, since (r?6/2) = h/2 
is the area swept by the radius vector in unit time.) In a small interval 
of time dt, the magnitude of the velocity changes by Av = (GM/r?)ét 
according to Newton’s law for the inverse square law force. The angle 
changes by A0 = (h/r?)ét from the conservation of angular momentum 
expressed as Eq. (1.3). Dividing the two relations, we get 


Av GM 
AO” h 
But, in the velocity space, Av is the arc length of the curve traced by the 
moving particle and A9 is the angle of turn. If the ratio between the two 
is a constant, then the curve must be (part of) a circle. 
An algebraic derivation of the same result is equally easy: Newton’s 
laws of motion give mv, = f(r) cos @;mvy = f(r) sin @ for any central force. 
Dividing these equations by Eq. (1.3) we get 


2 2 
dus FO oso, ity Or 


= constant. (1.4) 





mgh 70 h sin 0. (1.5) 
The miracle is now in sight for inverse square law force, for which f(r)r? 
is a constant. For the planetary motion we have f(r)r? = -GMm, leading 
to 
dv, GM _ dọ GM, 
We DR ee Fo = z Sine. (1.6) 
Integrating these equations, with the initial conditions v,(@ = 0) = 


0;vy(9 = 0) = u, squaring and adding, we get the equation to the tra- 
jectory in the velocity space, (called hodograph, if you like to know that 
sort of things): 


v? + (vy — u + GM/h? = (GM/h)? (1.7) 


which is a circle with center at (0,u — GM/h) and radius GM/h. So you 
see, planets actually move in circles! 

A little thought shows that the detailed structure depends crucially 
on the ratio between the initial velocity u and GM/h, motivating us to 
introduce a dimensionless quantity e by writing (u — GM/h) = e(GM/h). 
The geometrical meaning of e is clear from Fig. 1.1. If e = 0, ie, if we . 
had chosen the initial conditions such that u = GM/h, then the center 
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Fig. 1.1 Trajectory of a particle moving under r~? force in the velocity space with (a) 
e < 1 (left) and (b) e > 1 (right). 


of the hodograph is at the origin of the velocity space and the magnitude 
of the velocity remains constant. Since h = ur;, where r; is the initial 
position, we get u? = GM/r? leading to a circular orbit in the real space 
as well. When 0 < e < 1, the origin of the velocity space is inside the 
hodograph but is located off-center. As the particle moves the magnitude of 
the velocity changes between a maximum of (1+e)(GM/h) and a minimum 
of (1 — e)(GM/h). When e = 1, the origin of velocity space is at the 
circumference of the hodograph and the magnitude of the velocity vanishes 
at this point. In this case, the particle goes from a finite distance of closest 
approach to infinity, reaching infinity with zero speed. Clearly, e = 1 implies 
u? = 2GM/r; which is just the text book condition for escape velocity. 

When e > 1, the origin of velocity space is outside the hodograph 
and Fig. 1.1(b) shows the behaviour in this case. The maximum veloc- 
ity achieved by the particle is OB when the particle is at the point of 
closest approach in real space. The asymptotic velocities of the particle are 
OA and OC obtained by drawing the tangents from O to the circle. From 
the figure it is clear that sin = e~!. During the unbound motion of the 
particle, the velocity vector traverses the part ABC. 

The structure of the orbits can also be understood from the point of 
view of dimensional analysis. The trajectory of a planet is independent of 
the mass of the planet and hence is determined essentially by the energy 
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(per unit mass) and the angular momentum (per unit mass) of the planet. 
Let the energy per unit mass, with the dimension of square of the velocity, 
be (1/2)u? where u is the initial velocity. Given GM and u?, we can con- 
struct one quantity with the dimension of length: GM/u?. The availability 
of the second constant h — with the dimension of velocity times distance 
— allows us to construct a dimensionless ratio GM/uh = (1+ e)7?. It is 
the existence of this dimensionless ratio that determines the orbital struc- 
ture. If the angular momentum is small in a gravitating system, then its 
size is essentially determined by a single parameter GM/ u? where u is the 
characteristic velocity of the system. This result has an astonishing range 
of validity from satellites orbiting Earth to galaxies in the expanding uni- 
verse, which is (partly) due to the fact that the relevant eccentricities play 
a sub-dominant role in all astrophysical systems. 

So much for life in velocity space. To do anything practical, you need 
the trajectories in real space which — fortunately — turns out to be as 
easy: Consider the tangential velocity in the real space, vr, at any instant. 
This is clearly the component perpendicular to the instantaneous radius 
vector in real space. But in the central force problem, the velocity change 
Av is parallel to the radius vector. So ur is also perpendicular to Av. This 
result, of course, is true in the velocity space as well in which Av is tangent 
to the circle. The vr, which is perpendicular to Av, must be the component 
of velocity parallel to the radius of the circle in the velocity space. From 
Fig. 1.la, it is just the radius XS plus the projection of OS on XS (which 
is OS cos @). So 

vr = Ey OM e oos0 = Ei + ecos8) = * (1.8) 
with the last relation coming from the definition of angular momentum, 
viz. h = rvr. You will recognize that the trajectory r(@) in Eq. (1.8) is a 
conic section with a latus rectum of l = h?/GM and eccentricity of e. 

This derivation, however, leaves one question unanswered. A particle 
moving in 3 space dimensions has a phase space which is 6 dimensional. 
For any time independent central force, we have constancy of energy E and 
angular momentum J. Conservation of these four quantities (E, Jz, Jy, Jz) 
confines the motion to a region of 6 — 4 = 2 dimensions. The projection 
of this phase space trajectory on to xy-plane will, in general, fill a two 
dimensional region of space. So you would expect the orbit to fill a finite two 
dimensional region of this plane, if there are no other conserved quantities. 
But we know that the motion is on a conic section in the xy-plane, which 
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is an one dimensional curve. So, there must exist yet another conserved 
quantity for the inverse square law force which keeps the planet stick to one 
dimension rather than wander off in a two dimensional region. And indeed 
there is, which provides another cute way of solving the Kepler problem. 

To discover the last constant, consider the time derivative of the quan- 
tity (p x J) in any central force f(r)#. We have 








<@xJ)= pxJ= FO) x ( x mi) 
mf (r 
~ TAD ince. #) — ir?) = —mf (rr? = Ae k ag 
That is, 
d 2 dî 
The miracle of inverse square force is again in sight: When f(r)r? = 
constant= —GMm, we find that the vector (called Runge-Lenz vector 
though several others are also connected with its discovery): 
2 
A=pxg~ Sm r (1.11) 


is conserved. But we needed only one constant of motion while we now 
have got 3 components of A which will prevent the particle from moving at 
all! Such an overkill is avoided because A satisfies the following two, easily 
verified, relations: 


A? =ImJ*E+(GMm?)?; A.J =0 (1.12) 


where E = p*/2m —GMm/r is the conserved energy for the motion. The 
first relation tells you that the magnitude of A is fixed in terms of other 
constants of motion and the second one shows that A lies in the orbital 
plane. These two constraints reduce the number of independent constants 
in A from 3 to 1, exactly what we needed. It is this extra constant that 
keeps the planet on a closed orbit. To find that orbit, we only have to take 
the dot product of Eq.(1.11) with the radius vector r and use the identity 
r.(p x J) = J.(r x p) =J?. This gives 





A.r = Ar cosb = J? — GMm?r (1.13) 
or, in more familiar form, the conic section: 

1 GMm? A 

2 = = ; 1.14 

7 J G+ Gime °° 0) ( ) 
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(Compare with Eq. (1.8).) As a bonus we see that A is in the direction 
of the major axis of the ellipse. One can also verify that the offset of the 
center in the hodograph, (GM/h)e, is equal to (A/hm?). Thus A also has 
a geometrical interpretation in the velocity space. It all goes to show how 
special the inverse square law force is! 

If you add a 1/r? or 1/r4 component to the force, (which can arise, 
for example, from the general relativistic corrections to Newton’s law of 
gravitation or because the Sun is not spherical and has a small quadrupole 
moment) J and E are still conserved but not A. If the perturbation is 
small, it will make the direction of A slowly change in space and we will 
get a “precessing” ellipse. As an explicit example, consider the motion in 
a potential V = —(GMm/r) — (8/r?) where the second term is treated as 
a small perturbation. From Eq. (1.10), we find that the rate of change of 
Runge-Lenz vector is now given by A = (2Gm/r)(d/dt)(r/r). The change 
AA per orbit is obtained by integrating Adt over the range (0,7) where 
T is the period of the original orbit. Doing one integration by parts and 
changing the variable of integration from t to the polar angle 6, we get AA 
per orbit to be 


r dr 
Al obit = 26m [2 —— dé. (1.15) 


Let us take the coordinate system such that the unperturbed orbit originally 
had A pointing along the z—axis. After one orbit, a AA, component will 
be generated and the major axis of the ellipse would have precessed by an 
amount A@ = AA,/A. The AA, can be easily obtained from Eq. (1.15) 
by using y = r sin ĝ, converting the dependent variable from r to u = (1/r) 
and substituting (du/d@) = —(A/J*)sin@ [which comes from Eq. (1.14)]. 
This gives the angle of precession per orbit to be 


AAy _ _ 28m se du QnBm 
A Ady da P 








Ap = (1.16) 
In this particular case, the problem can be solved exactly (see Ex. 1.5) but 
the above trick will work for any general perturbation to the original (1/r) 
term. 

The next logical step from planetary motion is the gravitational two- 
body problem which, fortunately, turns out to be as simple as the one- 
body problem we have just solved. If we take two bodies with masses 
mı and mg and positions rı and rz, then the total momentum of the 
system Pitot = Mili + Mere can be written as (mı + m2)R where R = 
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[((miri + mer2)/(m1 + me)} is the position of the center of mass. For 
the closed system, Prot is conserved and the center of mass moves with 
uniform velocity. By going to another inertial frame which moves with 
the center of mass, we can treat R to be a constant and by shifting the 
coordinates, make the center of mass, the origin. If we further define the 
relative distance between the particles as r = (rı — r2), then it is easy to 
see that # = —G(mi + m2)(r/r?); that is, the relative separation obeys the 
same equation as in the one-body problem. In the coordinate system with 
the center of mass at the origin, the two bodies move around the center of 
mass. 

Incidentally, this fact also shows that, in any planetary system, the 
planets and the central star are moving around the common center of mass. 
Hence, the motion of the planet will lead to a wobble of the central star 
thereby providing one possible way of detecting extra-solar planets — which 
is a prelude to close encounters of the appropriate kind! Since the orbital 
velocity of Earth (with mass Mẹ) around Sun (with mass Mo) is about 
v = 29.7 km s71, the ‘reflex’ velocity of a Sun-like star due to an earth-like 
planet orbiting at the Earth-Sun distance 1 AU = 1.5 x 101° cm (called the 
astronomical unit), will be V ~ (Mg/Mo)v ~ 9 cm s71 which, however, is 
quite tiny. If, instead, we had a planet like Jupiter (about 320 times more 
massive than Earth), orbiting at 1 AU from the star, then we can easily 
detect the wobble with current technology. The current detection limit of 
about 3 ms~? will make it possible to catch a planet with a mass of about 
33 Mg orbiting a 1 Mo star at 1 AU under best orbital inclination. (See 
exercise 1.7 for the effect of orbital plane inclination). 

An interesting special case of astrophysical significance arises when the 
two bodies move around the common center of mass in circular orbits. The 
angular velocity is then given by 2 = [G(m1 + m2) /a3]!/* with a denoting 
the separation between the objects. If we transform to a coordinate system 
rotating with this angular velocity Q, then the two bodies will be stationary 
in this frame. But now the effective potential ¢ at any point r in the z — y 
plane (which we take to be the plane of orbit) will have a contribution 
from the centrifugal force as well as the normal gravitational field. This 
net (effective) gravitational potential is given by 


jaa Us ee eS L, (1.17) 


The first two terms are the standard gravitational potential while the gra- 
dient of the last term gives the centrifugal force. The equilibrium points 
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(that is, points on the x — y plane at which the acceleration, g = —V¢, 
vanishes) in this potential are of considerable interest. This is because, if 
you keep a test particle at such a point, it will not move in this rotating 
frame. (In the original, inertial, frame the particle will co-rotate with the 
two bodies.) The condition that the acceleration should vanish at some 
point r is given by: 
Gm, 
> lr =P |3 


Porta) +02 =0. (1.18) 


(r-rı)-— r-r 


It is fairly easy to show that there will be 3 such points (called the Lagrange 
points Lı, Lo and L3) along the line joining the masses. But you can easily 
verify that equilibria at these points are unstable. Unfortunately, there is 
no simple formula for the location of Lı, L2, L3. (If you write down the 
relevant equations, you will find that it is fifth order which has no ready 
made formula for its solution.) One can, however, produce an approximate 
solution when, say, m2 < mı. Since mı is then the primary source of 
gravity, Lı must be quite close to mg for the two gravitational forces to be 
comparable. If we take the location of Lı as x = ag — R with R <a, you 
can solve for R to first order and obtain R = (m2/3m,)!/3a. This radius 
is called Roche radius or radius of Hill sphere (depending on whether you 
are talking to a stellar astronomer or a planetary person). It is obvious 
that, if a body of mass mz is held together by self gravity, then it will 
be torn apart (by the tidal force of mass mj) if its size is bigger than R. 
For example, consider a binary star system made of a normal star and a 
compact companion star. If the normal star expands beyond R some of its 
mass will flow on to the compact companion which is a process of great 
importance in several astrophysical sources (see Chapter 5). 

There are also two equilibrium points (L4, Ls) that are off azis which are 
of greater interest. To find such solutions, take the dot product and cross 
product of Eq. (1.18) with r and use the facts that rı = a1X, rg = —a2ę 
and m a, = mza2. The cross product gives the relation 


lr —~r,|=|r—re|=d (1.19) 


which shows that L4 and Ls must be on the line bisecting the line joining 
the two masses. If either of these points are at a distance y from the line 
joining the masses, then d = [y? + (a?/4)]}/2. Next, taking the dot product 
of Eq. (1.18) with r, dividing by r? and re-arranging the terms gives 





Clr Fa) _ Gms tia i G- 2). 
5 


a FE 1+ peg 7 (1 .20) 


Chapter 1: Gravitation 11 


Since m1aı = m2Q@2, those two terms cancel out giving the result d = a. 
That is, we conclude 


lr — rı| = |r — r2| =a. (1.21) 


This remarkable result shows that the two off axis equilibrium points form 
an equilateral triangle of side a with respect to m and mg! This result is 
independent of the ratio (m1/mz). It can be verified that while L1, L2, L3 
are unstable equilibrium points, L4, Ls are indeed stable equilibrium points 
provided the mass ratio is less than about (1/25). (It is not obvious since 
these points are actually local mazima of the potential. The reason is subtle 
and has to do with the Coriolis force.) 

In our solar system, taking Sun and the Jupiter as the two dominant 
masses, one can determine the location of the two points L4 and L5, which 
are called the Trojan points. The asteroids close to these points (called 
Trojan asteroids) rotate around in the solar system fairly stably, forming 
an equilateral triangle with Sun and Jupiter. 

Finally, we mention that even the unstable Lagrange points have their 
uses. Since the net acceleration is zero at, for example, L2, a spacecraft 
which is put there needs comparatively less course correction. This is one 
of the reasons why the satellites with Next Generation Space Telescope 
(NGST) or the Laser Interferometer Space Antenna (LISA, a satellite based 
gravity wave detector) are planned to be put on the Earth-Sun Lg point. 


1.2 Precession and tides 


So far we have treated all the bodies involved in the gravitational interac- 
tions as point particles. Let us next look at some “real life” effects that 
occur in solar system in which we have to recognize the fact that plan- 
ets and satellites are deformable bodies of finite size and not structure-less 
point particles. 

To the lowest order of approximation, Earth can be treated as a flattened 
spheroid with the deviation from the spherical shape attributed, in large 
part, to the rotation. To estimate the flattening, let us note that the 
acceleration due to gravity at the poles is gp ~ -GMg/ RZ where Rp is 
the polar radius while the acceleration at the equator is ge % —(GM@/R?2). 
(You will recall that the subscript ® stands for values pertaining to Earth.) 
The difference in these two accelerations will be of the order of centrifugal 
acceleration Q? Re where 2 is the angular velocity of rotation of earth and 
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Re is the equatorial radius. Assuming that Re — Rp = ÔR « Re (where 
Re is the mean radius), we get (GM@/R%,)(6R/Re) ~ 2?Re. Such a 
deviation is characterized by an eccentricity of 


ôR QR 1 
R GMs 300 





e 


(1.22) 


To make the numerical estimate, remember that 27(R3,/GMg)!/? = 1.41 
hours while 27/Q = 24 hours. (The former time scale occurs in different 
contexts; it is (i) the gravitational collapse time scale for matter with the 
density of earth as well as (ii) the orbital period of a satellite hovering just 
above earth. It is a good number to remember.) Of course, once the earth is 
taken to be non-spherical, the gravitational acceleration need to be worked 
out again in a self-consistent manner. This can be done and the e changes 
only by a factor (5/4). 

This flattening at the poles makes earth a solid body with two moments 
of inertia (about the principle axis) being nearly equal and the third one a 
little different from the other two: I; = Iy # I, with 6I/I = 6R/R = 1/300. 
This deviation in the shape of the earth from a perfect sphere leads to some 
interesting effects. 

To begin with, since Earth rotates with an angular velocity w; = 2 = 
7.3 x 1075 per second (corresponding to a period of 24 hours), its axis of 
rotation will undergo (what is called) a free precession. To obtain this result, 
we need to recall some basic facts from rotational dynamics. In an inertial 
frame, the angular momentum of the body changes in accordance with the 
law (dJ/dt) = K where K is the external torque. The time derivative of 
any vector A in the inertial and rotating frames differ by the term Q x A. 
This is most easily seen by noting that if the vector A does not change in 
the moving frame, then its rate of change in the fixed frame is only due to 
rotation and we must have dA = 66 x A = (Q x A)dt. Hence this term 
represents the difference in (dA /dt) as measured in the rotating and inertial 
frames. Therefore, the equation of motion for the angular momentum in 
the rotating frame is given by 


dJ 
—+OxJ=K. : 
Ti +Q x K (1.23) 


Taking the components of this equation along the principle axis of the 


Chapter 1: Gravitation 13 


system, we get 


dQ 
re + (B — 12)0203 = Kı, 


dQ. 
h= + (iy — I3)03Q) = Ko, 


dQ 
he + (Ig — 11) 409 = K3. (1.24) 


These are the Euler’s equations which govern the rotational motion of a rigid 
body. Consider the special case of free rotation (i.e. no external torque, 
K = 0) of a body with axial symmetry, for which J; = Iz 4 Iz and Qa = 
constant (as in the case of earth rotating on its axis). Then we see from 
Eq. (1.24) that Qı and Q2 satisfy the equations Q; = —wpM2, Qe = Wp 
where wp = 03(/3 — 1) /Ji is a constant. Multiplying one of the equations 
by i and adding to the second equation, one can easily solve the set and 
obtain 


Qı = Acoswpt; Q = Asin Wpt (1.25) 


where A = (2+43)!/? is a constant. The motion, therefore, consists of the 
vector Q precessing with an angular velocity wp around the symmetry axis 
of the body, while remaining constant in magnitude. In fact, the component 
along axis-3 as well as the magnitude of the projection of Q on to 1-2 plane 
remain separately constant during such precession. For earth, 


de T kga 5 x 2.4 x 1077. (1.26) 
This precession corresponds to a period of about 300 days which turns out 
to be about 50 per cent smaller than the observed value. It is believed that 
the discrepancy arises due to complicating factors like tidal distortion of 
the Earth. 

A more interesting phenomenon is the forced precession of the axis of 
rotation of the Earth, which arises due to the torque exerted by Sun (and 
Moon; we will concentrate on the Sun) on Earth. This precession rate can 
be estimated fairly easily. Since the precession rate is small — something 
which we will verify at the end of the calculation — we can adopt a geocen- 
tric view and think of Sun as going around the Earth, once every year. For 
time scales much larger than a year, this is equivalent to the mass of the 
Sun, Mo, being distributed around earth along a circular ring of radius do 
(equal to the mean earth-sun distance) as far as computation of the torque 
causing the precession is concerned. In this case, (J ) = 0 and Eq. (1.23) 





Wp = 
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gives, Q x J = K where K is the external torque. The rotation axis of 
earth is inclined at an angle 0 ~ 23.5 degrees to the normal to the plane 
containing the ring (see Fig. 1.2). A perfectly spherical earth, of course, will 
feel no torque from the ring; but since the earth has bulge around equator 
with equatorial and polar radii differing by 6R/R ~ 1/300, there will be a 
torque, as if earth is a small dumbbell with masses 4M ~ M (R/R) located 
at the ends of a rod of length 2R. This net force F is due to the difference 
in the gravitational force of the sun (treated as a ring around earth) on 
the two masses of the dumbbell; so F ~ (GM o5M/d3,)(2Rcos@) and the 
corresponding torque is K ~ F(2Rsin9). The rate of precession will be 
Q x K/Jsin@ (since Q x J ~ K from Eq. (1.23)) where J ~ (MR?)w is 
the angular momentum due to daily rotation of earth. Combining all these 
and using (GMj@/d3,) = v? where v is the frequency of revolution of earth 
around the sun, we get 
4v? AR 


Ss R cos 6; (1.27) 





2r Torvit \ (AR\~* 
Tpre = y T orbit (z) (=) secO ~ 3 x 104 years. (1.28) 


There is a comparable contribution from the moon going around the earth. 
The gravitational force of Sun on Earth is, of course, significantly larger 
than that of Moon on Earth. But the tidal force due to Sun and the Moon 
has the ratio 


(Mo/ds) (Mo/R3)(Ro/do)® 
(Mmoon/@oon) = (Mimoon/ R3con)(Rmoon/dmoon)® 
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where 6 = (R/d) is the angle subtended by Sun or Moon at Earth and we 
have used the fact 6g ~ moon. So adding the effects of both sun and moon 
reduces the period to about 2 x 104 years which is close to what is observed. 
This means that the axis of rotation of earth will slowly precess and what we 
call the pole star, for example, will be different over few thousands of years! 
The direction of the equinoxes, with respect to fixed stars, will also shift 
slowly at the same rate. If the axis of rotation precesses by 360 degrees 
in 2 x 10* years, it implies a shift of about 1 arcminute per year, which 
even the ancient civilizations could take note of. Obviously, astronomical 
observations, which routinely work at fraction of arcsec precision needs to 
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corrected for this and one usually specifies the coordinates of the objects 
in the sky at a fixed reference date (used to be 1950A.D and is changing to 
2000A.D). 





Mo 


M (R/R) 


Fig. 1.2 Simplified picture to derive the precession of the equinoxes (definitely not to 
scale !). 


The same effect will try to align the orbital plane of the moon (which 
is inclined at about 5 degrees) with the ecliptic. Similar analysis shows 
that the line of intersection of these two planes precesses with a period of 
about 18 years. Since the intersection of the the orbital plane of the moon 
with the ecliptic gives the directions near which eclipses occur, this effect 
is of considerable practical importance. The precession of the moon’s orbit 
plus its inclination implies that the torque it exerts on earth has small 
fluctuations. Consequently, the precessing path of earth’s axis has a tiny 
wobbling motion called nutation. Once again, this was known for centuries! 


Another phenomenon which arises because bodies are deformable is the 
tide raised on earth, which we shall consider next. Let us begin with the 
tidal effects due to the Moon. The action of Moon’s gravitational force 
deforms the surface of the Earth periodically thereby leading to tidal bulges 
on the surface of the Earth. These tidal bulges will be located directly 
under the Moon if: (i) there is no other force (like friction) acting on the 
system and (ii) the rotational period of earth is identical to the orbital 
period of moon around earth. In reality the surface deformation of earth is 
affected by fairly strong frictional forces and the rotation of earth is quite 
fast compared to the orbital rate of the moon around earth. Hence the tidal 
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bulge will not be aligned along the direction of the Moon. The fast rotation 
of the planet will drag the axis of the bulge ahead of the earth-moon line 
leading to several interesting consequences: 

This causes a torque between the Moon and the earth tending to align 
the axis of the bulge along the earth-moon line; an equal and opposite 
torque acts on the moon increasing its orbital angular momentum. The 
net effect is to decrease the angular velocity of earth (thereby lengthening 
the day) and increase the orbital angular momentum of moon which in 
turn, makes the earth-moon distance to increase (thereby lengthening the 
month). In fact study of fossilized corals show that the length of the month 
was shorter in the past; earth has indeed slowed down and moon has moved 
away. You can see that, eventually, earth will rotate once in its axis exactly 
in the time it takes for moon to complete one orbit around earth. Also, 
since circular orbits have the lowest energy for a given angular momentum, 
this process also leads to the circularisation of orbits. 

The final length of the day resulting from the tidal interaction (which 
will be the same as the final orbital period of Moon) can be easily estimated 
from the conservation of angular momentum. Let Mı, M2 be the masses 
of earth and moon, w; is the rotational speed of Earth, we is the orbital 
speed of Moon, wy is the final values for these two in synchronous rotation, 
R the distance between earth and moon and Ry is the corresponding final 
distance then the conservation of angular momentum gives 


Ty wy + Ip w2 + Maw R = hws + Iwys + Mgwy R3. (1.30) 


In arriving at this equation, we have ignored the spin angular momentum 
of Moon at the present epoch and the spin angular momenta of Earth and 
Moon in the final state; you can easily verify that these are ignorable. 
Further, orbital dynamics requires (Rf/R) = (w2/w1)?/3. Taking I) = 
(2/5)M, R? etc. we get 


3 
w2 2 Mı Ry Wt 
—=/14+-[([— — — = 1.99. 1.31 
W Ff b+ (4) (3 W2 (Eal) 
Hence the final day (and month) will be about 54 days and the final Moon’s 
orbital radius will be about 5.9 x 10° km. 
At present, the fractional increase in the distance to the moon is 1.3 x 
10—1° per year and the lengthening of the day is by 7 x 1074 sec per century. 


Precise calculation of the tidal friction, described above, suggests that — 
while it could be the dominant effect — there could be other contributions 
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to the lengthening of day etc. A completely satisfactory model, accounting 
for all the observations is not yet available. 

A corresponding effect, of course, occurs due to the tides produced on 
the moon by earth. If the initial position of the moon was closer to earth 
and its orbital period was different from the rotational period about its 
own axis, then the tides raised by earth on moon will have the effect of 
synchronizing the orbital and rotational periods of moon. Being a smaller 
body, this effect on moon occurs at a shorter time scale and has already 
caused the moon to be in such a synchronous, tidally locked orbit with 
respect to earth. In fact, most of the satellites in solar system have this 
synchronous feature. 

Let us consider some details of the tidal interaction between earth and 
moon using a fairly simple model. We will assume that earth-moon system 
goes around the common center of mass in circular orbits and work in a 
coordinate system that is co-rotating with them. Suppose that a point near 
the surface of earth has the coordinates (1,6) where l measures the radial 
distance from the center of the Earth and 6 measures the angle from the 
instantaneous earth-moon line. If Mı and M2 are the masses of earth and 
the moon, R is the distance between their centers, we can compute the 
effective gravitational potential at any given point with coordinates (r, 0) 
using Eq. (1.17). (Note that in this equation the r refers to the position 
of a particle with respect to the center of mass of the Earth-Moon system 
and not with respect to the center of the Earth.) The potential due to the 
Moon is given by 





M.: P ae 
See = n (: RT 25 cos8) (1.32) 
~ GM2 l 1 2 I? l 3 
amen: 1+ ges? + 5(3cos 0-1) + O R 








where we have done a Taylor expansion in l/R. (If you know your Legendre 
polynomials, you will see this immediately.) The term (1/R)cos@ varies 
linearly along the z = l cos0 direction from M; to Mg so that its gradient 
gives a constant force. This is precisely canceled by a corresponding term 
in the centrifugal force; which is, of course, necessary for the centrifugal 
force to keep the bodies in circular orbits. Noting that the center of mass 
(from which the vector r is measured) is at a distance M2R/(M, + Mo), 
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we can compute the centrifugal contribution to be 


M. 2 
1 2 2 iL 2 2 2 Mo 

z = — i¢ —2 | ———_ a'ei 
zer Ge le F” + Miia a.cos @ (1.33) 


Using w? = G(M, + M2)/R? we see that the term linear in cos@ indeed 
cancels. Adding the contribution due to Earth, the net potential near the 
surface of the Earth varies as 


GM, 3 GMoa? 


l 2 R 


The equi-potential surfaces, given by (1,0) = constant, will provide the 
lowest order deformation of the spherical shape of the earth due to tidal 
forces. To get a better feeling for this shape, one can set | = Rg +h (where 
Rọ is the radius of the Earth) and expand Ẹ(l,8) in a Taylor series in A. 
A simple calculation now gives, to the lowest non trivial order in h, the 
expression 








O(1,0) =— cos” 0 + constant. (1.34) 


3 
O(h, 0) xg |» - > (2) (=) Rg cos? o| + constant (1.35) 


Mı 
where g is the acceleration due to gravity on the surface of the earth and 
higher order terms in (Rg/R) and (h/R) are ignored. Setting © to constant 
shows that the deformation of the surface, h, is given by 


_ 3/M2\ (Re\°* 5 
A= 2 Ga) (42) Re cos* @. (1.36) 


For (M2/M,) ~ (1/80); Ro ~ 6400 km and R = 3.8 x 10° km, the difference 
between h(9 = 0) and A(O = 7/2) is about 54 cm. The corresponding 
calculation for Sun’s gravitational field on earth produces a shift of about 
25 cm. It should be noted that the tidal force has a cos? 0 = (1/2)(cos 20+1) 
dependence where @ is the angle measured from the location of the moon. 
This leads to a maximum at both 6 = 0 and 6 = a which is characteristic 
of a tidal force. 

The same formalism also allows you to obtain another interesting result. 
Consider a small body of mass m which is orbiting close to a large body 
of mass M. It is intuitively obvious that, if the small body gets too close 
to the large one, tidal force can rip it apart. How close can it get and 
still survive ? To determine this, align the z-axis along the line joining the 
centers of the bodies and consider the gravitational acceleration along the 
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z-axis. This is given by the gradient of the potential in Eq. (1.34), which 
we will now write, using z = 1cos@ and dropping the constant term, as: 


2 2 
oqo- Gm _ 3 GMP og Gm _ 3 GMz 





1 2 Ry 2 RS 





(1.37) 


for points along the z-axis. It is easy to see that the gradient gives g, = 
—d¢/dz = Gz[|(3M/R?) — (m/|z|8)|. The force in the first term is trying 
to rip the body apart while the second term is the restoring force due to 
self gravity. The stability of the body against tidal disruption requires 
(3M/R?) < m/|z|°. This allows us to define a critical radius Rroche = 
(3M/m)*/3|z| (which is exactly the result we obtained earlier from the 
study of Lagrange points; see the discussion after Eq. (1.18)}. Writing |z] 
as r and the mass of the satellite as m = 4rpr?/3, the condition for stability 
is R > Rroche where 


1/3 1/3 
Rroche = (=) zx 1.44 (=) Ry (1.38) 
Arp pı 


is the Roche radius and M = 47 pẹ R3/3 where p, and R, are the density and 
radius of the central object. (A more precise calculation changes the factor 
1.44 to about 2.9.) Of course, this limit really makes sense only when Rroche 
is larger than the radius R, of the massive object. As an example, consider 
a small body which gets close to the sun. The condition Rroche > Ro leads 
to the constraint p < (9/47)(Mo/R3) =% 4.2 gm cm~3, which holds for 
most objects in solar system. So any of them will be disrupted tidally if 
they get too close to the sun. 


1.3 Virial theorem 


When we move from 2, 3 or few body problem to the genuine many body 
problem in gravity, life becomes rather difficult. Usually, such a many 
body limit of a system can be understood using statistical physics. But 
conventional ideas of statistical mechanics do not work with gravity because 
conventional statistical mechanics assumes that energy is extensive. If you 
divide the molecules of a gas into two parts, the total energy is (to a very 
good approximation) the sum of the internal energies of the two individual 
parts. But if you divide a star cluster or galaxy into two parts, the total 
energy is not the sum of two parts; the gravitational interaction energy 
between the two parts makes a significant contribution. Molecules in a 
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gas interact through short range forces and hence the interaction between 
molecules in one bulk region with molecules in another region is essentially 
confined to those which are near the surface separating the two regions. 
Hence the ratio of interaction energy to bulk energy scales as the ratio of 
surface area to the volume, L?/L* ~ L~1 where L is a typical size of the 
region. In any large volume of sufficiently dilute gas, we can ignore the 
interaction energy and treat energy as an extensive variable. This does not 
work in the presence of gravity which is a long range interaction. 

There is, however, one beautiful result — called virial theorem — which 
can be proved in gravitational N-body problem. Consider a collection of 
particles with masses m; and positions r; interacting through gravity. Start- 
ing from the equation of motion for the i-th particle 


a Gmjmi(ri = rj) 
Mir; = > ir; mee F (1.39) 
jži 4 





we take the dot product of both sides with r; and sum over all i to get 
ù Gmymir; (ry rj) 
ome y E (1.40) 


Writing r; = (1/2) (r; — r;) + (1/2)(r; + r;) and using the antisymmetry of 
(r; — rj) under pair exchange, one can write this as 


rs G 
Y miri ñi = -3 S anpi Mami (1.41) 
P jx lr: — 5] = rj] 
where U is the gravitational potential energy. On the other hand, we can 
relate the left hand E of Eq. (1.41) to the second time derivative of the 
quantity I = | mirz. Direct algebra gives 

Co ae 2 a s ; 

2 az Se (1.42) 


where K is the kinetic energy of the system. This result, when combined 
with Eq. (1.41) leads to the virial theorem: 


1er 
2 dt? 
(Historically, the quantity }> F; - r; was called virial and hence the name). 


You might think this is rather useless since Ï is not a simple quantity 
to determine. The utility of virial theorem comes from the fact that we 


=U+2K. (1.43) 
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can handle it in two important cases. First, when the system is (at least 
approximately) in steady state we can set Ë = 0 thereby obtaining a relation 
between kinetic, potential and total energies: 

2K +U =0; E=-K= =U. (1.44) 
Second, suppose the system is (reasonably) bound so that the time average 
< I > is at least quasi-periodic making its time derivative vanish. Then, 
taking the time average of both sides of Eq. (1.43) we get 2(K) + (U) =0. 
In either context, we have an estimate of the relevant energies. 

One of the key applications of virial theorem is in the determination of 
masses of large groups and clusters of galaxies. For a group of N galaxies, 
treated as point masses with masses m;, positions r; and velocities v;, the 
virial theorem can be written as 


GMM; 
Yoro 5 ee rrr (1.45) 


tA 


Treating the quantities appearing in the above equation as approximately 
equal to the time-averages, we can use this relation to determine the total 
mass of the system. Observationally, we can only determine the line-of- 
sight velocity dispersion a? and the two dimensional projection R; of r; on 
to the plane of the sky. On the average, we expect v? = 307; the projected 
inverse separation can be related to the true inverse separation by averaging 
over all possible orientations of the group. It is easy to show that 


(IR: - Byl)™) = 5 (e = ry) (1.46) 


Finally, we will also assume that the mass-to-light ratio Q of all the galaxies 
is the same so that m; = QL; where L; is the luminosity of the i-th galaxy. 
Substituting m; = QL; in Eq. (1.45) and using Eq. (1.46) we get 


3m Fii Lio} 


Qa ig ete = 
2G Diy; LiL; (Ru — Ry|-?) 


(1.47) 


Since all the quantities on the right hand side can be determined from the 
observations, we can find Q for the galaxies in the cluster. This procedure 
will give reasonably reliable results for groups with sufficiently large number 
of galaxies. Several groups have been studied using the above equation and 
observations suggest a median value of Q ~ 260 Qo. This is one of the 
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key evidences for the existence of significant quantities of dark matter in 
virialized clusters and groups of galaxies. 

One curious consequence of the virial theorem is that sravitating sys- 
tems have negative specific heat. Recall that we can associate a temperature 
T to the kinetic energy of random motion K by K = (3/2)NkgT. The spe- 
cific heat is Cy = (dE/dT); since T œ K, the sign of Cy is same as that 
of (dE/dK) = —1 from virial theorem, making Cy < 0. This behaviour, of 
course, is well known even in the case of gravitational one body problem. 
A satellite in a circular orbit around Earth, when it loses energy, will go to 
an orbit with lower radius but higher orbital velocity. Loss of energy heats 
up a self gravitating system. 

The importance of this result arises from the fact that no system in 
canonical ensemble can have Cy < 0. To see why, recall that the mean 
energy of a system in canonical ensemble is given by 


BEX )= 7 Lee ef =-(nZ); Z= gees (1.48) 


where 8 = (kgT)~1 and the prime denotes the derivative with respect to 
B. It follows that 


Cy dE | Ke. Kozal ZE AZN 
ke dD -B E' = Pn Z)” = BP (F- (5) ) (1.49) 


But the (e?) = (Z"/Z) and (e)? = (Z2'/Z)? giving 





cy = Ple) — (e)) = (Ae)? > 0. (1.50) 


How does one reconcile this result with the fact that the specific heat of 
a self gravitating system is negative? The contradiction implies that the 
statistical description of a self gravitating system can never be based on 
canonical ensemble. It turns out that one can handle the situation using 
microcanonical ensemble for which Cy can be negative. The proof for the 
equivalence of the two ensembles — given in any text book on statistical 
mechanics -— uses the extensivity of energy which, as we noted before, no 
longer holds. 

In many astrophysical contexts, one has to deal with gaseous systems 
described by standard thermodynamics rather than point particles. To 
obtain the virial theorem for such systems, let us first recall some basic 
results for ideal gases with constant specific heats. For these systems, the 
definition Cy = dUint/dT integrates to Uint = CyT. For example, if the 
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gas has some f internal degrees of freedom — vibrational, rotational ete. 
— in addition to the 3 degrees of freedom for translational motion, then 
Vine = (3/2)NkpT[1+(f/3)| and Cy = (3/2)Nkg[1+(f/3)]. On the other 
hand, using 


dQ = Wint-+PdV = d(Uine +PV)—-VdP = CydT+NkgdT-VdP (1.51) 


we find that the specific heat at constant pressure, Cp = (dQ/dT)p = 
Cy + Nkg. The ratio of specific heats, y, then becomes 





Cp Nkpg + NkpT 


O Cy Vint mee) 


allowing us to write 
Uine = (¥-1)7 1 NkpT =(y-1)71PV; =P=(y-le (1.53) 


where € is the energy per unit volume. 

For such a system, we need to take into account the force due to pressure 
gradients while deriving the virial theorem. In this case the virial will pick 
up an additional contribution which is the sum of r;-F; = —r;-VP where 
P is the pressure. Writing 


ri: FE; =—r,:-VP=-V-(iP)+PV-1=-V-@iP)+3P (1.54) 


the summation over all particles can be thought of as an integration over 
the volume. The first term gives a surface contribution and we will take 
the surface to be one at constant pressure P,. Noting that the integral of 
r-n over the surface is just the volume integral of V - r, this term gives 
—3P,V where V is the volume of the system. In the second term we can 
use Eq. (1.53) to write P = (y—1)e. The integral of e over the volume will 
give Uint so that this term becomes 3(7 — 1)Uint. Then the virial theorem 
reads: 


2 
se = 2K + Ugr + 3(7 — 1)Uint — 3PsV. (1.55) 
Now K denotes the kinetic energy of bulk motion of gas (if any) while the 
kinetic energy due to random, thermal, motion is incorporated into Vint. 
Let us look at some consequences of this relation. 

First, consider the case in which P, = 0 and K = 0 (no bulk motion) 
which will represent a gaseous, self-gravitating system. Writing the total 
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energy of the system as E = Ug, + Uint, the virial theorem becomes 


1d?I 
5 ge 7 Ver + 3(7— DUm = 3(y- 1E — (87-4) ge (1-56) 
In steady state, the time derivative must vanish, implying 
37-4 
E = a Uer 1.57 
3(y-1) * (1.57) 


which gives a simple relation between the total energy and the gravitational 
potential energy Ugr. (Of course, Eq. (1.57) reduces to Eq. (1.44) when 
= 5/3 which corresponds to an ideal monatomic gas.) For a bound 
system, we require E < 0; since U,,r < 0 it is necessary that y > (4/3). The 
system will be unstable when y < (4/3). Some insight into the nature of 
instability can be obtained by rewriting the original Eq. (1.56) in the form 
7 — (4/3) (d?I/dt?) 


FET (1.58) 


When y > (4/3) and E < 0, the last term can vanish and the system can be 
in equilibrium. If some physical process now drives y towards (4/3) with E 
still negative, the last term cannot vanish and we must have 2E ~ (d?I/dt?) 
when y =~ (4/3). Since E < 0, this implies that (d?I/dt?) < 0. Usually 
such a condition would imply a collapse of the system. 

As a second example, consider an ideal monatomic gas, (with Uint = 
(3/2)kBT, y = 5/3) in the absence of bulk motion (K = 0) and in steady 
state (Ï = 0). The virial theorem now gives 3kgT + Uj, = 3P,V. Let us 
apply this relation to a (nearly) spherical configuration of fluid with mass 
M, radius R and temperature T in equilibrium with the gravity balanced by 
the pressure. The gravitational potential energy of such a configuration will 
be Usr = —a(GM?/R) where a is a numerical constant possibly of order 
unity. In terms of the one dimensional velocity dispersion u? = (kgT'/m) 
the virial theorem becomes 





1 (ae ~~) (1.59) 


sahe Re 
In the absence of gravity (G = 0), this equation gives P,V = Mu? which is 
merely the ideal gas law. In the presence of gravitational field, such a cloud 
will have a tendency to collapse under self gravity; so we expect P, to be 
reduced. Equation (1.59) shows that this is precisely what happens. As we 
increase the value of M, a point is reached when P, vanishes. For larger 
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values of M, the pressure P, will become negative indicating an instability 
in the system. The critical value of mass is given by 


_3(wWR\_ 3 /kBTR 
Merit = Gi (=) = oe ( mG ) . (1.60) 


Clouds of a given size R and temperature T will collapse under self gravity 
if M > Mert(T, R). This critical mass is usually called the Jean’s mass. 
The instability of systems with M > Merit is the key to understanding both 
star formation and galaxy formation, as we shall see in later chapters. 





1.4 Gravitational collisions and relaxation 


There is another significant aspect in which self-gravitating systems differ 
from more familiar gaseous systems. In a gas, at normal temperatures 
and pressures, collisions are very frequent. That is, the mean free path 
due to molecular collisions is small compared to the characteristic size of 
the system. The situation is very different for self gravitating systems. The 
exchange of energy between particles due to gravitational scattering off each 
other is a very slow process. Hence this process cannot relax the system, in 
contrast to molecular scattering in a normal gas. Let us estimate the time 
scale for this effect. 

Consider a gravitational encounter between two bodies in an N-body 
system (say, a cluster of stars) each of mass m and relative velocity v. If 
the impact parameter is b, then the typical transverse velocity induced by 
the encounter is du, ~ (Gm/b*) (2b/v) = (2Gm/bv). The deflection of 
the stars can be significant if (6v,/v) Z 1, which occurs for collisions with 
impact parameter b < be ~ (Gm/v?). Thus the effective cross section for 
collision with significant change of momentum is o ~ b? ~ (G?m?/v‘). 
The collisional relaxation time-scale will be 

1 R3y3 N Rv’ 


tg > — x 


(nov) N (Gm?) ~~ GM?" (1.61) 


If the system is virialized, then (GM/R) ~ v? and we can write 


aan Vc 8), a. 


Thus the collisional time-scale is N times the “crossing time” (R/v) for 
the system. For most gravitational systems, (R/v) will be comparable to 
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the dynamical time scale while N will be a large number making tge much 
larger than the time scales of interest. 

There is, however, another effect which operates at a slightly shorter 
time-scale. When the impact parameter b is far larger than be, the critical 
impact parameter be ~ (Gm/v*) obtained above, individual collisions still 
impart a small transverse velocity (6v1) ~ (Gm/bv) to a given star. The 
effect of a large number of such collisions is to make the star perform a 
random walk in the velocity space. The net mean-square-velocity induced 
by collisions with impact parameters in the range (b,b+ db) in a time in- 
terval At will be the product of mean number of scatterings in time (At) 


and (6v,)?. The former is given by the number of scatterers in the volume 
(2b db)(vAt). Hence 


((6v,)”) = (2mbdb) (vAt) n (5) (1.63) 


where n is the number density of scatterers. The total mean-square trans- 
verse velocity due to all stars is found by integrating over b within some 
range (b1, b2): 


2 n b2 G2m?2 
((6v1)")totar ~ At [ (2mbdb) (un) (S) 


2m2 
SAME Ajin (2) . (1.64) 
Vv by 


The logarithmic factor shows that we cannot take bı = 0, b2 = oo and one 
needs to use some physical criteria to fix bı and bg. It is reasonable to 
take b2 œ~ R, the size of the system; As regards bı, notice that the effect 
discussed earlier in Eq. (1.62) becomes important for b < be. So we may take 
by = be ~ (Gm/v?) . Then (b2/b1) ~ (Ru?/Gm) = N (Rv?/GM) ~ N for 
a system in virial equilibrium. This effect is important over time-scales 
(At) which is long enough to make ((6v1)”) total = v2. Using this condition 
and solving for (At) we get: 


v? N R tac 
oie rG mnn N ~ (sar) (=) E (sity) Zo) 


which is a shorter time-scale compared to tg, obtained in Eq. (1.62). The 
numerical factor arises from a more precise calculation and the above ex- 
pression gives the correct order of magnitude for the gravitational relaxation 
time of several systems. 
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For most astrophysical systems this time scale is very large compared to, 
say, the age of the universe. For example, a typical galaxy has R % 10 kpc, 
N = 10", v = 300 km s~}, giving tg. & 3x 10!8 yrs and (At) go. ~ 1.2x 1017 
yrs which are much longer than the age of the universe ty œ 10!° yrs. 
Among the systems of astrophysical interest, only globular clusters — which 
are gravitationally bound systems of about 10° stars — have a relaxation 
time (of about 0.4 Gyr) which is about a tenth of their age; one therefore 
expects to see the effects of gravitational relaxation in these systems. For 
all other systems like galaxies, clusters of galaxies etc., the relaxation time 
is much larger than the age of the system, thereby making it difficult to 
observe any effects of relaxation. 

Incidentally, the above result depends crucially both on the dimension 
of the space and the nature of the force law. If the analysis is repeated in 
a D-dimensional space with a force law which varies as r—” then we will 
need to evaluate the integral of b?-?” over all impact parameters in the 
range bmin < b < bmaz. In 3-dimension (D = 3), the integral converges 
on the upper limit for n > 2 and gives a logarithmic divergence for n = 2. 
This again shows that distant encounters are important for gravity (n = 2) 
but not for short range forces (n > 3). On the other hand, a strictly 2- 
dimensional system (like a disk of self gravitating matter) interacting via 
inverse square gravitational force (D = 2,n = 2) will have a relaxation time 
(R/v) which is the same as crossing time. Thus strictly two dimensional 
mass distributions can relax through gravity within a few crossing times. 

Our analysis also allows us to estimate the time scale for another closely 
related effect. Consider a test body of mass M; moving through a cluster 
of stars, say, where each “field” star has a mass ms. The energy exchanged 
between the test mass and the field stars is again governed by the same 
time scale as the one estimated above in Eq. (1.65), with Gm? replaced 
by GMimys. If the field stars were moving with typical random velocities 
larger than that of the test mass, then the net effect of the stars on the 
test mass will be to exert a drag force on the mass. This process — called 
dynamical friction — has the time scale 


ys 
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Note that the result only depends on the background mass density pf = 


Tdyn.fri = 
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nmp rather than on the mass my. A body moving through a dark matter 
halo of density p will have the same Tayn.fri irrespective of whether the halo is 
made of elementary particles (like neutrinos) or planet like objects with the 
mass of Jupiter. This process can be of importance in many astrophysical 
contexts since the time scale is now smaller by the factor (m s/M} compared 
to tgc. The motion of stars in the central regions of galaxies, the motion of 
clusters etc. are all affected by this phenomenon (see eg., Exercises (8.1), 
(8.14)). 

Finally, we must mention another peculiar phenomena which takes place 
in a system containing a large number of particles interacting through grav- 
itational forces. Such particles can relax in a time scale which is typically 
a few times the dynamical time scale by a process known as violent relaz- 
ation. This works by exchange of energy in a time dependent potential as 
well as through mixing of groups of particles in phase space. The details of 
this process are fairly complex but there is no doubt that this is the most 
rapid process which works among self gravitating system of particles allow- 
ing them to relax to a configuration which is approximately Maxwellian in 
the velocity distribution. The importance of this process lies in the fact 
that it operates at a time scale which is just a factor of few larger than the 
dynamical time scale. 


1.5 Relativistic gravity 


So far, we have discussed gravity in the Newtonian framework. There are 
several astrophysical applications in which one needs to go beyond this and 
use concepts from general relativity, which we will briefly introduce in this 
section. 

We developed the Newtonian gravity starting with the notion of gravita- 
tional potential ¢ related to the mass density p by Eq. (1.1). With suitable 
boundary conditions, this equation is equivalent to V7¢(t,x) = 4mGp(t, x). 
This equation shows that if you move a mass near the origin slightly, its 
gravitational field at a distant point should change instantaneously. This, 
of course, does not agree with the basic notion in special relativity which 
requires signals to propagate at most with the speed of light. It is clear 
that — at the very least — we need to tinker with this equation in order to 
get a theory of gravity which is consistent with special relativity. Actually, 
the situation is much more complicated than that, as we shall see. 

The peculiar features arise from the principle of equivalence which we 
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mentioned in page 2. Consider a region of space (say, near the surface 
of Earth), in which we can take the gravitational acceleration g = gx 
to be uniform and constant. A non relativistic particle moving in this 
gravitational field along the x-axis will have a trajectory x = (1/2)gt? if 
it starts at rest from the origin at t = 0. This trajectory is completely 
independent of any of the properties of the particle, in particular its mass. 
Two particles of masses m and m’ would have followed the same trajectory 
if the initial conditions were the same. 

Consider now a different situation. Suppose there are two particles 
of masses m and m’ at rest in some frame of reference S. Let us now 
view these particles from another frame of reference (5) — say, a moving 
car — which is moving with an acceleration (—g) along the x-axis. The 
coordinates in $ and S are related by Z = x + (1/2)gt?. Hence, viewed 
from the moving frame, a particle which is located at the origin of S' will be 
given the trajectories Z = (1/2)gt?. Once again, both the masses m and m’ 
will have the same trajectory because the “motion” of the particle arises 
merely due to coordinate transformation between S and S’. This simple 
example shows that the motion of particles under Newtonian gravity can 
be mimicked by the motion viewed from an accelerated frame, as long as 
we can treat g as a constant. 

Let us now push this analogy further. Consider a clock which is attached 
to the rim of a disk of radius r which is rotating with an angular velocity w 
about an axis perpendicular to the disk through the center. The speed of 
the rotating clock, v = wr, remains constant as it goes around. If we now 
compare the reading of this clock, T(r), with that of another clock located 
at the center of the disk, T(0), we will find that they differ due to special 
relativistic time dilation: 


T(r) =T(O) V1 — (v2/c?) = T(0) V1 — (w?r2/c?). (1.67) 


The rotating clock also experiences an acceleration g = w?r which is in- 
dependent of any property of the clock. If we enclose the clock in a small 
box, then any mass inside the box will experience the same acceleration. 
Since the acceleration is independent of the mass of the particles, one could 
attribute this acceleration to a gravitational field generated by a potential 
¢(r) such that —0¢/dr = w?r. This gives ¢(r) = —(1/2)w?r?, allowing us 
to write 


T(r) = T(0)V/1 — (w?r?/c?) = T(0) V1 + (2¢(r)/c?). (1.68) 
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This suggests that the rate of flow of time as measured by clocks is affected 
by the presence of gravitational potential! In special relativity, two clocks 
which are at rest with respect to each other will run at the same rate. 
If our result is correct, then a clock kept on a table (in a gravitational 
field) will run at a rate different from a clock kept on the ground since the 
gravitational potential felt by the two clocks are different. 

Another argument leading to the same dramatic result, based on special 
relativistic dynamics, can be provided along the following lines. The La- 
grangian for a non relativistic free particle is given by ZL = (1/2)mv? while 
that for a particle in a gravitational potential ¢ will have an additional 
piece (—m¢) incorporating the contribution from the potential energy. In 
special relativity, the corresponding action for a free particle is given by 


2 
A= -mè f anfi — 5 = -me f ds (1.69) 


ds? = edt? — dx? (1.70) 


where 


is the spacetime interval. Since ds/c denotes the proper time for the parti- 
cle, one can introduce the gravitational potential into special relativity by 
adding an extra term to the above action, leading to: 


A= -me f ds — T fods = -me f ds(a + $). (1.71) 


This expression is Lorentz invariant if we treat ọ as a scalar. Further, 
m appears as an overall multiplicative constant whìch does not affect the 
equations of motion for a particle, thereby ensuring the validity of principle 
of equivalence. (Of course, this action also has the correct non relativistic 


limit.) Writing ds = cdt,/1 — (v?/c?) and using 
2 442 v? ¢\? 242 2 142 
cdt h — z (1 + 5) x edt? — vdt? + 2¢dt? + O(c?) (1.72) 


we notice that our action in Eq. (1.71) can be written in a much more 


meaningful way for ọ < c?: 
/ v2 $ 


= —(me) / (1 + =) edt? — ax] if (1.73) 


ASG J (cdt) 
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This form suggests that one possible effect of gravity (when ¢ < c?) is to 
modify the line interval in the spacetime to the form 


2 j 
ds? = (: + =) edt? — dx? = gindx'da*. (1.74) 


The second equation defines the symmetric metric tensor gap with goo = 
(1+ 2¢/c?) and gag = —dag and we have assumed the Einstein summa- 
tion convention (one of the most important contributions Einstein made to 
science!): Any index repeated in an expression is summed over. Here i,k 
are summed over 0,1, 2,3. Given this modification for the form of ds?, the 
action for the particle can still be taken to be that of a “free” particle in 
Eq. (1.69) with a different form for ds. 

This is a truly profound conclusion for several reasons. To begin with, 
it ties up nicely with the result of our clock experiment. Suppose a clock 
is at rest (dx = 0) in some coordinate system which has a gravitational 
potential ¢. The proper time recorded by the clock is now given by the 
Eq. (1.74) to be ds = dt,/1 + 2¢/c? which is exactly what we found earlier 
[see Eq. (1.68)]. Gravity does affect the rate of flow of clocks in precisely 
the manner we originally asserted. Second, the above modification of the 
line interval from that in Eq. (1.70) to the one in Eq. (1.74) shows that the 
particle still moves from event to event extremizing the spacetime interval. 
That is, the trajectory of a particle is a curve of extremum length (called 
geodesic) in the spacetime even in the presence of gravitational field. All 
that has happened is that the underlying distances in spacetimes are mod- 
ified due to gravity. Third, the modification of the line interval into a form 
like Eq. (1.74) makes the spacetime curved. This opens up a completely 
new way of interpreting gravity — not as a force but as curvature of space- 
time. All this is possible mainly because of principle of equivalence which 
allows one to state that the effect of gravity on a particle is independent of 
the properties of the particle. 

The above result also blurs the distinction between different coordinate 
systems which one might use in describing physics. In special relativity, one 
gives preference to inertial frames of reference in which the spacetime inter- 
val has the form in Eq. (1.70). The Lorentz transformations preserve the 
form of this line interval and the class of all coordinate systems connected 
to each other by Lorentz transformations enjoy a special status in special 
relativity. But since gravity is locally indistinguishable from an accelerated 
frame, we no longer need to confine our attention to frames of reference 
which are moving with uniform velocity with respect to each other. Once 
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we introduce coordinate transformations connecting frames which are ac- 
celerated with respect to each other, the line interval no longer retains the 
simple form in Eq. (1.70). We now need to study more general line intervals 
in which the coefficients are functions of space and time. The expression for 
the line interval in Eq. (1.74) emphasizes the same point. In general, both 
the existence of genuine gravitational field as well as the use of accelerated 
coordinate systems in an inertial frames will lead to modification of the line 
element and one no longer retains a special status to any particular class 
of coordinate systems. 

Another general result which follows from our discussion is that the 
frequency of radiation will change when it propagates from event to event in 
a curved spacetime, since the rate of clocks at different points are different. 
In the case of a static metric, the proper rate of flow of clock at a point x 
is given by \/goodt. Since the frequency is inversely related to the period, 
it follows that 


w(x) _ ygol) 
w(x’) \/goo(x) 


If we assume that goo * 1 at very large distances from a mass distribution, 
then the frequency of radiation measured by an observer at infinity (woo) 
will be related to the frequency of radiation emitted at some point x by 
Woo = W(X) y/Go0(x). 

So far we have assumed that ¢ < c”. In this limit, only the coefficient of 
dt? gets modified due to the presence of gravity and one can still describe 
the gravitational field in terms of just one potential ¢. In general, all 
the coefficients in the quadratic expression for ds? will get modified and 
the gravitational field will be described by the symmetric metric tensor 
Jab(= Jba) which has 10 independent components. One of the simplest 
situations arises in the case of spacetime outside a spherically symmetric 
distribution of matter with total mass M. The metric for this case can be 
obtained in a heuristic manner, which we will now describe. 

Let us consider a point P at a distance r from the origin where the 
gravitational potential is ¢y = —GM/r according to Newtonian theory. If 
we consider a small box around P which is freely falling towards the origin, 
then the metric in the coordinates used by a freely falling observer in the 
box will be just that of special relativity: 


(1.75) 


ds? = c*dt?, — dr? (1.76) 


in’ 


This is because, in the freely falling frame, the observer is weight-less and 
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there is no effective gravity. (The subscript ‘in’ is for inertial frame.) 
Let us now transform the coordinates from the inertial frame to a frame 
(T,r) which will be used by observers who are at rest around the point 
P. Since the freely falling frame will be moving with a radial velocity 
v(r) = —f,/2GM/r around P (if it starts at rest at infinity), the required 
(non-relativistic!) transformations are: dti, = dT, drin = dr — vdT. Sub- 
stituting these in Eq. (1.76), we find the metric in the new coordinates to 
be 





ds? = h- ae 5 dT? — 2,/(2GM/r)drdT — dr. (1.77) 


Incredibly enough, this turns out to be the correct. metric describing the 
spacetime around a spherically symmetric mass distribution of total mass 
M! It will be nicer to have the metric in diagonal form. This can be done 
by making a coordinate transformation of the time coordinate (from T to 
t) in order to eliminate the off-diagonal term. We look for a transformation 
of the form T = t+ J(r) with some function J(r). This is equivalent 
to taking dT = dt + K(r)dr with K = dJ/dr. Substituting for dT in 
Eq. (1.77) we find that the off-diagonal term is eliminated if we choose 
K(r) = /2GM/c!r(1—2GM/c?r)~!. In this case, the new time coordinate 


is: 
JQ@GM/r) 
ame far+s far Ee 


8GMr _4GM nh! | 
= 








anh 


= cT — — 
c? c cr 


(1.78) 





This leads to the line element 


-1 
ds* = =(1 — =) edt? — (1 — | dr? — r? (d6? + sin? 6d¢”) 
cr cr 

(1.79) 

called the Schwarzschild metric, which is used extensively both in the study 
of general relativistic corrections to motion in the solar system and in black 
hole physics. It is clear from the form of Eq. (1.79) that something peculiar 
is happening at the radius rs = (2GM/c?) called the Schwarzschild radius. 
It turns out that no signal can propagate from r < rg to r > rg — which 
earns the surface r = rg the name event horizon. We shall say more about 
this in chapter 5, Sec. 5.4 while discussing black holes. Of course, if the 
body has a radius R greater than rg, then the metric in Eq. (1.79) is valid 
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only for r > R. Such a metric could still describe the geometry around, 
say, Sun. 


1.6 Gravitational lensing 


Another key feature of general relativity is that it affects the trajectory of 
light rays. Once again, the basic idea can be understood from the equiva- 
lence principle. Consider a cubical box of size | which is being accelerated 
upwards with an acceleration g. Let a light source on one wall of the box 
emit a pulse of light towards the opposite wall. It will take a time of about 
t = l/c to reach the opposite wall during which time the wall would have 
moved up by an amount x = (1/2)gt? = (1/2)(gl2/c?). An observer inside 
the box will see that the light beam has been deflected downwards by an 
angle 6 = x/l = gl/2c?. The observer will interpret this deflection as due 
to the effect of gravity on the light beam and will estimate it as 


Ox fat. (1.80) 


If one uses this formula for a light beam which is grazing the surface of 
Sun, the total deflection it would suffer would be about GMo/c?Ro. (In 
general, we expect a bending angle of the order of (¢/c?) in a gravitational 
potential ¢.) For a sun like star, GMo/c?Ro ~ Rs/Rọ ~ few arc sec. 
This deflection is huge, in spite of general relativists sometimes claiming 
it is small; if you don’t believe that, ask any radio astronomer who works 
with milliarcsec accuracy for sources in the sky. 

To get the details and the numerical factors right, one can proceed as 
follows. We first recall that when the light propagates in a media with a 
spatially varying refractive index n(x), its trajectory can be determined 
from the Fermat’s principle, which extremizes the time of travel 


where dl is the spatial line element and we follow the convention that the 
Greek indices run over just 1, 2,3. Extremizing this integral gives the equa- 


tion of motion 
d (p) _ on ioe 
d\ a) dae" (1:82) 
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On using Eq. (1.81) again in the form, dl = cdr/n, this equation becomes 


1 d / 2dz*\_ dinn 
2 dr (» =) Oxo" 

Our aim now is to approximate a weak gravitational field as a medium 
with a spatially varying refractive index. To do this, we note that, in the 
metric in Eq. (1.79), the proper time is dr = dt(1 + 2¢/c?)!/2, while the 
proper length is dl = dr(1 + 2¢/c?)-1/? giving 


dl _ dr dr 


g ltt 26/88 S gl 24e) (1.84) 


Assuming that light always travels with the speed c in proper coordinates 
we have dl/dr = c and taking v = (dr/dt), one gets the effective refractive 
index to be n = c/v ~ (1 — 2¢/c?). This can be interpreted as the effective 
refractive index of a spacetime with the gravitational potential ¢. When 
this is used in Eq. (1.83) to the lowest order in ¢/c?, we have the equation 
of motion for the light ray to be 

dae O¢ 

Pru -2a (1.85) 
Thus the net force acting on a light beam in a gravitational field is twice 
the Newtonian force. Though we used the form of the metric in Eq. (1.79) 
to get this result, it happens to be of general validity to the lowest order in 
p/e. 

Let us estimate the deflection of light d due to a bounded density dis- 

tribution p(x) producing a gravitational potential ¢(x). If a light ray is 
moving along the z—axis, it will experience a total transverse angular de- 


flection by the amount 
00 QV 
0 = 2f dz | z t| (1.86) 


—00 





(1.83) 





where V = (8/ðx,ð/ðy) is the gradient in the z — y plane. Such a 
deflection, of course, can change the apparent position of the source in the 
sky, distort the shape of an extended object, produce multiple images of 
the same source etc.. All these effects are collectively called gravitational 
lensing and each of them is a powerful tool in observational astronomy. We 
will now discuss some key features of gravitational lensing. 

To do this, we shall assume that all the deflection takes place on a sin- 
gle plane which we take to be the plane of the deflecting mass — usually 
called the lens —- and project the image position and the source position on 
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to the lens plane. The distances between source and observer, source and 
lens, and observer and lens are give by Dos, Dis, Dot, respectively. From 
Fig. 1.3, it is clear that 0, = 6; — (Dis/Dos)@. Let s and i denote the two 
dimensional vectors giving the source and image positions projected on the 
two dimensional lens plane; and let d(i) be the (vectorial) deflection pro- 
duced by the lens corresponding to the angular deflection 0. (See Fig.1.3). 
Then, the relation 6, = 0; — (Dis/Dos)é leads to the vector relation: 








Dis 
ey emia 1.87 
s=i- Pag) (1.87) 
ote Dos = 
Dis Dou a 


Dogê®i 








Observer 





Fig. 1.3 The geometry for gravitational lensing. 


Clearly, all the physics is in the deflection vector d(i). Consider now the 
2—dimensional divergence (?)V - d of this vector. Using 


Oo Org 


(726 = V2o— au = AOE a5 (1.88) 


and Eq. (1.86), we get 


2D 
QV7.d= oe dz [cpe rac? 2$] - OTOL yg ,y) (1.89) 
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where X(x, y) is the surface mass density corresponding to p(x) obtained 
by integrating p(x,y,z) along the z—axis. [The 07¢/0z? term vanishes 
on integration]. This two-dimensional Poisson equation has the standard 
solution: 





TE 12o J d?23(x) = ai (1.90) 


which gives the deflection d in terms of the surface density U(x). This 
completely solves the problem. 

As a simple application, consider the lensing due to a point mass, for 
which S(x,y) = M6(x)d(y); Eq. (1.90) then gives the deflection d(i) = 
(4GM Dot /c?)(i/i?). Substituting into Eq. (1.87), we find 


s=ih-5]; r? = 4GMDisDor 


D. (1.91) 


This result shows that s and i are in the same direction and their magnitudes 
are related by s = i — (L?/i). Solving this quadratic equation, we find the 
two solutions for the image position i to be 


a ; (s + Vs? + 41?) l (1.92) 


Thus any source produces two images when lensed by a point mass with 
the two images located on either side of the source. You can also see that 
one image is inside a circular ring of radius L (called Einstein ring) and the 
other outside. As the source moves away from the lens (ie., as s increases) 
one of the images approaches the lens (and becomes fainter, as we shall 
see below) while the other approaches the true position of the source (and 
tends towards a magnification of unity). 

In the cosmological context in which lensing often occurs, a foreground 
galaxy (or a cluster) will act as a lens for a background quasar (or a galaxy). 
In this case, all relevant distances can be taken to be D = 3000 Mpc (see 
Chapter 6; 1 Mpc ~ 3 x 1074 cm). Then the angular size of this Einstein 
ring will be 


L 4QM\ 1/2 M 1/2 D -1/2 
ôL = D” (SS ) =~ 2 arc sec (sa) (a) : 
(1 


93) 
One can also think of 6; as the typical angular separation between images 
produced by lens of mass M located at cosmological distances. For galaxy 
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with M ~ 10!2Mo, we have 6 ~ 2 arcsec, while for a cluster M ~ 10!° Mo 
and r = 1 arc arcmin. 

Even when the lensing action does not produce multiple images, it can 
significantly distort the image of an extended object. (It is usual to call 
lensing action which produces multiple images as strong lensing and the one 
which merely distorts the background image as weak lensing.) Consider, 
for example, the lensing action of a foreground cluster of galaxies on several 
distant background galaxies. The cluster will distort the background images 
of the galaxies and will shear them in the tangential direction. Mapping 
this distortion allows you — among other things — to determine the mass 
of the foreground cluster which is acting as a lens. Since all the gravitating 
matter contributes to the lensing action, this approach provides a direct 
measurement of the total gravitating matter in the cluster including the 
dark matter. Such observations have indicated that there is nearly few 
hundred times more dark matter in clusters of galaxies than visible matter. 

This lensing action can also lead to magnification of the source. This 
magnification can be calculated by considering the change in the image 
position ĝi for a small change in the source position ds. Since all the photons 
from a small area element As around s reach another small area element 
Ai around i, the magnification A = (Ai/As) will be the determinant of the 
matrix of the transformation between the two: 


Dia = ace 


asp 
The amplification of the two images in Eq. (1.92), for example, are given 
by 


dsp 


A=d 
et Dia 


lis (1.94) 














-1 
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A+ =]|l-{>— = — = t- 1.95 
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where u = s/L. It is clear that the magnification of the image inside the 

Einstein ring is negative. The total magnification of the flux obtained by 

adding the absolute value of the two magnifications is given by 





u? +2 
uvu? +4 
An important application of the lensing by a point mass is in a phe- 


nomenon called microlensing. If, for example, a compact object in our 
galaxy (say a Jupiter like non luminous object) moves across the line of 


Atot = Ay + A_ = (1.96) 
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sight to a distant star in, say, the Large Magellanic Cloud (LMC, which 
is a small satellite galaxy of Milky Way), it will lead to a magnification of 
the light from the star under optimal conditions. The peak magnification 
in Eq. (1.96) is at u = 1 is Amax = 1.34 which should be easily detectable. 
This is an indirect way of detecting massive, compact halo objects (called 
MACHOS) in the halo of our galaxy. The typical time scale for a microlens- 
ing event is 


t= Date -oam (Ge) (oes) (G8) Geom) 

v l Mo 10 kpc Dos 200 km s+ 
(1.97 
In the case of MACHOS located in our galactic halo acting as lenses for 
sources in the LMC, we need to sample the light curve of stars in LMC 
for between about an hour and a year to detect lenses with the mass range 
of (107° — 10)Mo. The probability for such lensing action, of course, is 
low and hence it is necessary to monitor a large number of stars for a long 
period of time in order to produce observable effects. If the number density 
of lenses is n and if we take the typical cross section of each lens to be 70? 
(see Eq. (1.93)) then the probability of lensing over a distance D will be 
P ~ (nD3)(6?) ~ GpD?/c*. (Incidentally, this depends only on the mass 
density of the lenses and not on the actual masses.) For lensing of stars in 
LMC by compact objects in the halo of our galaxy, we may assume that all 
the dark matter density in the galactic halo is contributed by these lensing 
objects. Then we can use p = 107g cm~’, D = 500 kpc; this gives a 
rather small value of P œ 1078. You are saved by the fact that there are 
about 5 x 10° stars in the LMC and a large fraction of them can be studied 
systematically. 





Exercises 


Exercise 1.1 Shapes don’t matter: 


Compute the gravitational potential energy of a sphere of radius R and mass M, 
if it has a constant density. Now suppose that the same mass M is flattened to 
a thin disk of radius R with constant surface density. Compute the the potential 
energy of the new configuration. What is the moral of the story? [Hint: For a 
sphere U = (3/5)GM?/R while for a disk it is (8/37)GM?/R. Under extreme 
flattening, the numerical coefficient changes only by a factor 1.41.] 
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Exercise 1.2 Going full circle: 


Can you find a physical situation in which the minor arc AC of Fig.1.1 could be 
meaningful? [Hint: Repulsive thoughts might help.] 


Exercise 1.3 Dynamic ranges for physical parameters: 


Compute the characteristic speeds [using the v? ~ GM/R] for (a) planets in 
solar system; (b) stars in a galaxy and (c) galaxies in the universe. What is 
the dynamic range in the variation of mass, radius and density in these systems 
compared to the dynamic range in speed ? What do you conclude ? 


Exercise 1.4 Estimates for solar system: 


(a) Where is the center of mass of the solar system? (b) Where is the center 
of mass of the Earth-Moon system? (c) What is Jupiter’s orbital angular mo- 
mentum? (d) What is the rotational angular momentum of the Sun (which is 
rotating about its axis with a period of 25 days)? (e) What should be the speed 
with which an object should be launched to escape the gravitational force of the 
solar system? What if we want to escape the Milky Way galaxy? 


Exercise 1.5 Exact solution with (1/r?) term: 


Solve the central force problem with the potential V = —(GMm/r) — (8/r’) 
exactly and show that the bound orbits are of the form (1/r) = Cı + C2 cos/[(1 — 
(28m/J?))\/?0] where Ci,C2 are constants. Hence show that when the G/r? 
term is a small perturbation to the (1/r) potential, the orbit precesses at the rate 
given by Eq. (1.16). [Hint: In any central force problem, the angular momentum 
gives a term proportional to J?/r?; hence the (1/r”) correction to potential only 
“renormalizes” the J? term.] 


Exercise 1.6 Basic measurements: 


Some great moments in early astronomy occurred when different civilizations 
managed to measure the astronomical distances of nearby objects. This exercise 
explores three of them: (a) Ancient astronomers knew that lunar eclipse occurs 
when the shadow of the Earth falls on the Moon. Assume that Sun is very far 
away from the Earth. Show how one can determine the ratio of the diameter of 
the Moon to the diameter of the Earth from the duration of the lunar eclipse. 
Given the diameter of the Moon and its angular size in the sky, we can determine 
the Earth-Moon distance in terms of the Earth diameter. (b) At noon on the 
summer solstice, the Sun is seen overhead at Syene, a city almost directly south 
of Alexandria. (This is known from the fact that at that time, buildings and 
poles cast virtually no shadows.) At the same moment, in Alexandria, a vertical 
rod AB casts a shadow AC on the ground such that angle ABC is 7.2 degrees. 
Show how this measurement can be used to determine the diameter of the Earth 
given the distance between Syene and Alexandria. (This distance was recorded 


Chapter 1: Gravitation Al 


to be 5000 stadia; there is some confusion as to how much is 1 stadia but if it is 
157.2 m, as some scholars believe, you get the correct result.) Using the result 
in (a), one can now determine the Earth-Moon distance. (c) A half Moon occurs 
when the Sun(S), Moon(M) and the Earth(E) are at the vertices of a right angled 
triangle SEM. By measuring angle SEM (which is about 89.85 degrees!) show 
how one can determine distance between Earth and Sun. 


Exercise 1.7 Kepler motion of a binary system: 


Consider two stars of masses mı and mz orbiting around the common center 
of mass (in an elliptical path) under their mutual gravitational attraction. The 
plane of the orbit is inclined at an angle i to the line of sight connecting the 
observer to the focus. From the Doppler shift of the radiation emitted by the 
star 1, it is possible to determine the velocity of that star along the line of sight. 
Show how this information will help one to determine the combination 


m3 sin? i 


This procedure is of importance in determining the mass of an invisible companion 
in a binary stellar system. 


Exercise 1.8 Effect of tides: 


Give simple physical arguments to show that (a) the tide raised on the satellite 
by the planet has the following effects: (i) It de-spins the satellite towards syn- 
chronous rotation; (ii) it acts to damp the satellite’s orbital eccentricity and (iii) 
it can also tidally heat up the satellite. (b) Also show that the tide raised on 
the planet by the satellite has the following effects. (i) It increases the satellite’s 
semi major axis if wrot > Wrey and Wrey > 0 where wrot is the angular velocity of 
rotation of the planet and wrey is the angular velocity of the satellite around the 
planet. If wrot < Wrey (as in the case of the satellite Phobos of Mars), the semi 
major axis will decrease and the satellite can eventually crash on to the planet. 
(ii) It decreases the planet’s spin rate if wrot < Wrey and Wrey > 0. 


Exercise 1.9 Standard error in textbooks: 


Some textbooks give the following argument as regards the Schwarzschild radius. 
“The escape velocity from a body of mass M is vesc = \/2GM/R. If the radius 
of the body is less than 2GM Jè, then vesc > c and not even light can escape 
from the body.” Even assuming that Newtonian considerations are admissible, 
explain why this argument is completely wrong. 


Exercise 1.10 Seeing the Oort cloud: 


Can the comets in Oort cloud lens the background stars in our galaxy? Give 
reasons for your conclusion; look up the necessary parameters of the Oort cloud. 
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Exercise 1.11 Bounds on the mass of the dark matter particle: 


In the conventional cosmological models (which we will discuss in chapter 6) one 
invokes dark matter made of weakly interacting massive particles which form 
large halos around galaxies and in clusters. For the sake of this problem, assume 
that the dark matter is made of massive neutrinos (this is not true!). During 
the early epochs of the evolution of the universe, the maximum phase space 
density of neutrinos will be (dN, /d°zd°p) max % 2/(27h)*. (Explain why.) During 
the collisionless evolution and clustering, eventually leading to a gravitationally 
bound structure of mass M and radius R, regions of high phase space density will 
be mixed with regions of low phase space density. In this process, the maximum 
phase space density can only decrease. Therefore the mean final phase space 
density has to be less than the original maximum value. Show that this allows 
one to put a constraint on the mass of the neutrino as 


—1/4 R —1/2 
> poe 05S, 
my Z 30 eV (sm = =) (z =) (1.99) 





where ø is the velocity dispersion of the dark matter halo. 


Notes and References 


More material on gravitation, directly relevant to astrophysics, can be found in 
chapters 2 and 10 of Padmanabhan (2000). General relativistic description of 
gravity is discussed in e.g., Landau and Lifshitz (1975a), Misner Thorne and 
Wheeler (1973) and chapter 11 of Padmanabhan (2000). 

1. Section 1.1: (a) More details regarding the discovery of extra solar planets 
can be found, for example, in the website http://cfa-www.harvard.edu/planets. 
(b) A detailed discussion of reduced three body problem can be found in Henon 
(1993) and in Marchal (1990). 

2. Section 1.2: The shape of a rotating fluid mass and related details can be 
found, for example, in Chandrasekhar (1969) and Binney and Tremaine (1987). 

3. Sections 1.3 and 1.4: For a detailed discussion of statistical mechanics of 
gravitating systems, see Padmanabhan (1990). Many of these topics are also 
covered in Binney and Tremaine (1987). The process of violent relaxation was 
discovered by Lynden-Bell. See Lynden-Bell (1967). 

4. Section 1.5: For a discussion of this particular approach to deriving 
Schwarzschild metric, see Visser (2003) and references cited therein. 


Chapter 2 


Radiative Processes 


Electromagnetic radiation is the single most important diagnostic tool in 
astrophysics. In any given astrophysical context, you need to relate the 
properties of the radiation received on earth to the properties of the source 
and identify the signatures of the processes which are involved. So we have 
to familiarize ourselves with some of the key radiative processes, which is 
the task we address in this chapter. 

Broadly speaking, one can divide the radiation from astrophysical 
sources into line and continuum radiation. Line radiation arises when the 
source makes a transition between two discrete energy levels thereby pro- 
ducing a distinct spectral line. This process is usually quantum mechanical 
in nature. The continuum radiation, on the other hand, can arise in both 
classical and quantum mechanical contexts. Classically, different kinds of 
accelerating mechanisms for the charged particle will lead to different radi- 
ation patterns. Quantum mechanically, it is possible for a system to make a 
transition from a discrete to continuum energy level (like when an electron 
is kicked off a hydrogen atom) thereby leading to continuum radiation. All 
these processes play their role in astrophysics. 


2.1 The origin of radiation 


We will first describe the origin of radiation in classical and quantum the- 
ories and then discuss the specific processes. 


2.1.1 Radiation in classical theory 
The Coloumb field of a charged particle varies as E œ r~?. The energy 


density of this field, which scales as E? œ r~‘, falls fairly rapidly with 
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distance. Contrast this with the radiation field produced by an accelerated 
charge for which E œ (1/r) and energy density varying as r72. How does 
this come about (and the caterpillar becomes the butterfly, as Feynman 
calls it) just because the charge is accelerated? 

To understand this, let us begin by noting that a charge q moving 
with uniform velocity v is equivalent to a current j = qv. This current 
will now produce a magnetic field (in addition to the electric field) which 
scales in proportion to j. Uniform velocities, of course, cannot produce any 
qualitatively different phenomena compared to those due to a charge at rest, 
because you could always view it from another Lorentz frame in which the 
charge is at rest. The situation becomes interesting when we consider a 
charge moving with an acceleration. If a = v # 0, it will produce a non 
zero (Oj/Ot) and hence a non zero (OB/Odt). Through Faraday’s law, the 
(OB/0t) will induce an electric field which scales as (Oj/At). (That is, if 
(0j/Ot) changes by factor 2, the electric field will change by factor 2.) It 
follows that an accelerated charge will produce an electric field which is 
linear in (0j/Ot) = qa. [This, of course, is in addition to the usual Coulomb 
term which is independent of a and falls as r~?.] Let us consider this electric 
field in the instantaneous rest frame of the charge. It has to be constructed 
from qg,a,c and r and has to be linear in ga. Hence it must have the general 
form 





e-c@ 2-00 (S)(aia) 


where C is a dimensionless factor, depending only on the angle 0 between 
r and a, and n and m need to be determined. (Since v = 0 in the in- 
stantaneous rest frame, the field cannot depend on the velocity.) From 
dimensional analysis, it immediately follows that n = 2,m = 1. Hence 


E=C(6) > (2.2) 


Thus, dimensional analysis plus the fact that E must be linear in q and a, 
implies the r~! dependence for the radiation term. 

There are several ways of determining the factor C(@). The most rig- 
orous method is to consider the field of a charged particle which has just 
begun to accelerate. We will, however, use a more heuristic approach. 
At very large distances from the charged particle, the electric field is 
E = Ecou + Eraa in the instantaneous global rest frame of the charged 
particle. Here Ecou: is the standard Coulomb field which is along the radial 
direction and Esaa is the radiation field which has the magnitude given by 


Chapter 2: Radiative Processes 45 


Eq. (2.2). We can determine C(@) if we can determine the direction of 
Eaa. To do this, we only need to note that one of the Maxwell’s equations, 
V: E = 4rọp continues to be valid even in the case of an accelerated charge. 
Hence the flux of the electric field through a spherical surface at a large 
distance should be equal to the total charge enclosed by the surface. Since 
the Coulomb field already satisfies this condition, the total flux of Eaa 
over such a surface must vanish. If this has to happen without violating 
the symmetries of the problem, then E,,g must be normal to the radius 
vector which requires C(6) = sin@. 

Hence the two components of the electric field are given by: (i) the 
radial Coulomb field Ey = E» = (q/r*) and (ii) the transverse radiation 
field (with a, = asin 6): 


na (E) AA) as 


It is also obvious from special relativistic considerations that the accelera- 
tion of a charge located at x’ at time t’ can only affect the behaviour of the 
field at a location x at a time t where t = t’+|x’ — x|/c. Thus to determine 
the field at (t,x), we need to use the state of motion of the particle at a 
retarded time t' = t—|x’ — x|/c. This is the radiation field located in a shell 
at r = ct and is propagating outward with a velocity c. One can express it 
concisely in the vector notation as 


Eyaa (tr) = 4 [ża x (âx a) (2.4) 
c r ret 

where n = (r/r) and the subscript “ret” implies that the expression in 
square brackets should be evaluated at t’ = t—r/c. The full electric field in 
the frame in which the charge is instantaneously at rest, is E = Ecoul + Erad.- 
The velocity dependence of this field can be obtained by making a Lorentz 
transformation. You will recall that, if a primed reference frame moves with 
a velocity v with respect to an unprimed reference frame, then we have: 


,=7(B+< xB) , (=7(B-= xE) (2.5) 


where E] and B, are the components of electric and magnetic fields per- 
pendicular to v; the components Ej, By parallel to v do not change under 
the Lorentz transformation. 

There are two crucial ways in which the radiation field differs from the 
Coulomb like field. First, the radiation field is transverse to the direction 
of propagation in contrast to the Coulomb field, which is radial. Second, 
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it falls only as (1/r) in contrast to the Coulomb field which falls as r~?. It 
is also clear that each spherical shell containing the radiation field moves 
forward with the speed of light c. Since the energy density in the electro- 
magnetic field is U = (E? + B*)/8z, the amount of energy crossing a sphere 
of radius r centered at the charge is 


2 

€ = e(4rr?°) -2- Ead, (2.6) 
Here the factor 2 takes into account an equal contribution from magnetic 
field and the symbol (---) denotes an averaging over the solid angle. Using 
(sin? 0} = 1— (cos? 6) = 1—((z?) /r?) = 2/3 (where the result (2?) /r? = 1/3 
follows from the symmetry among the three axes) we find that the total 
amount of energy radiated per second in all directions by a particle with 
charge q moving with acceleration a is given by 


dE 2g? 25 

ad 30° 
provided the acceleration is measured in the frame in which the particle is 
instantaneously at rest. The rate of energy emission, of course, is indepen- 
dent of the frame used to define it because both dE and dt transform in the 
identical manner under Lorentz transformation in this case. This result in 
Eq. (2.7) is called Larmor’s formula and can be used to understand a host 
of classical electromagnetic phenomena. 

Since d = (gx) is the “dipole” moment related to a charge located at 
position x, this formula shows that the total power radiated is proportional 
to the square of d. In bounded motion, if d varies at frequency w (so that 
d= ~—w?d), then the energy radiated is given by 


dE 2@ a 

— = — — W 

dt 30 
This turns out to be the correct formula for radiation from an oscillat- 
ing dipole. Different physical phenomena are essentially characterized by 
different sources of acceleration in Eq. (2.7) for the charged particle. 


(2.7) 


(2.8) 


2.1.2 Radiation in quantum theory 


Let us next consider the origin of radiation in quantum theory. The ra- 
diation field in quantum theory is described in terms of photons and the 
emission or absorption of photons arises when a system makes a transition 
from one energy level to another. Consider an initial state with an atom, 
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say, in ground state |G) and n photons present. Absorbing a photon, the 
atom makes the transition to the excited state |Z), leaving behind a (n — 1) 
photon state. Let the probability for this process be 


P (\G)|n) > |En- 1)) xn = Qn. (2.9) 


The fact that this absorption probability P is proportional to n seems intu- 
itively acceptable. Consider now the probability P’ for the ‘time reversed’ 
process (|E)|n — 1) > |G)|n)). By principle of microscopic reversibility, we 
expect P’ = P giving P’ x n = Qn. Calling n — 1 = m, we get 


P' (|E)|m) = |G)|m + 1)) = Qn = Q(m + 1). (2.10) 


You now see that P’ is non-zero even for m = 0; clearly, 
P' (|E)|0) — |G)|1)) = Q which gives the probability for a process conven- 
tionally called spontaneous emission. The term Qm gives the corresponding 
probability for stimulated emission. Thus, the fact that absorption proba- 
bilities are proportional to n while emission probabilities are proportional 
to (n + 1) originates most directly from the principle of microscopic re- 
versibility. Between any two levels, we can write the two probabilities as 


Rabs = Qn; Rem = Q(n+ 1). (2.11) 


Historically, however, radiation was described in terms of a quantity 
called intensity before we knew about the existence of photons. To connect 
the two concepts, let us consider the number of photons with momentum 
in the interval (p, p+ d°p) which is given by dN = 2n|[Vd’p/(2rħ)?]. Here 
the factor in the square brackets gives the number of quantum states in the 
phase volume Vdp and the factor 2 takes into account the two spin states 
for each photon. The corresponding energy dE flowing through the volume 
element d*x = dA(cdt) will be dE = hvdN = 2nhv dA{cdt]{d*p/h*]. Using 
|p| = hv/c and writing d°p = p*dpdQ = (h/c)3v*dvdQ where dQ is the 
solid angle in the direction of p, we get 


2hv3 


dE = -3 





n dA dt dQ dv (2.12) 


which suggests the definition of a quantity called intensity, which is the 
energy per unit area per unit time per solid angle per frequency (that is, 
energy per everything), as 

dE 2hv 


=) = 


dAdtdQdv a nw): 2:13) 


48 An Invitation to Astrophysics 


Among other things, this equation shows that: (i) (I, /v?) is Lorentz invari- 
ant, being proportional to the number of photons per phase cell; often, this 
provides a quick way to transform the intensity from one frame to another. 
(ii) J, measured from a source in the sky is independent of the distance to 
the source if no photons are lost in the way; in fact J, does not change even 
on passing through any lossless optical device (like, for example, a lens). 

While using intensity, it was conventional to redefine the rates of ab- 
sorption and emission in Eq. (2.11) by 


Raæs = Bly; Rem = A+B, (2.14) 


where the coefficients A,B are called Einstein coefficients. Using I, = 
(2hv3/c?)n, and comparing with Eq. (2.11) we find that 


2hv? A 2hv3 


In the old fashioned language based on intensity, one feels somewhat mysti- 
fied as to why the ratio of A/B has this strange value; of course, if you think 
in terms of photons, the corresponding ratio in Eq. (2.11) is just unity. 

To gain a feel for the relative importance of the spontaneous emission 
(governed by A) and stimulated emission, governed by B, let us compute the 
intensity I, of radiation at which the stimulated and spontaneous emission 
rates are equal; BJ, = A. The energy flux contained in a small frequency 
interval dw in such a beam will be I,dw = (A/B)dw = (hw? /2n*c*)dw. For 
visible light, with w ~ 3 x 101° Hz and dw = 2r x 10! Hz, we get Ipdw ~ 10 
Watt cm~?. Conventional light sources do not have such high intensities 
while a laser beam does. 

The intensity defined in Eq. (2.13) has a dQ indicating that it is per unit 
solid angle where the solid angle is measured in steradians. The definition of 
1 steradian is just 1 square radian which is equal to (360/27)? = 3.28 x 10 
square degree. This leads to the conversions 





1 st = 4.25 x 107° arc s?; 1 sq deg = 3 x 1074 st. (2.16) 


The solid angle in entire sphere is 4r steradians which corresponds to 4.12 x 
10* deg?. 
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2.2 Thermal radiation 


As an immediate consequence of the result in Eq. (2.11) we can determine 
the the energy distribution of photons in equilibrium with matter at tem- 
perature T. In such a situation, photons will be continuously absorbed and 
emitted by matter. Consider the rate of absorption (or emission) of pho- 
tons between any two levels, say, a ground state |G) and an excited state 
|E). We saw in Eq. (2.11) that the absorption rate per atom is given by 
Qn while the emission rate is Q(n + 1) where n is the number of photons 
present and Q is to determined from the quantum theory of radiation. In 
steady state, the number of upward and downward transitions must match, 
which requires NcQn = NgQ(n + 1) where No, Ne are the number of 
atoms in ground state and excited state, respectively. Solving for n we get 
n = [(Nc/Ng) — 1)7}. Since matter is in thermal equilibrium at temper- 
ature T, we must also have (Nz/Na) = exp(—AE/kgT) where AE is the 
energy difference between the two levels. This relation should hold for all 
forms of matter with arbitrary energy levels; hence we can take the ground 
state energy to be zero (i.e, arbitrarily small) and the excited state energy 
to be E, leading to 


1 1 


"= (Ng/Na) 21 ene a 


This equation gives the number of photons with energy E = hy [or — 
equivalently — with momentum p = (£/c)| in thermal equilibrium with 
matter. The intensity of this thermal radiation obtained from Eq. (2.13) is 


dE 2hv3 2hv? 1 


dAdtdQdv if c2 
This is known as Planck spectrum. For the sake of practical applications, 
we will give the numerical results as well. The numerical value of B, is 


r3 


e? — 1 
(2.19) 
and z = hy/kgT. From Eq. (2.13) it is clear that integrating I, over 
dQ and dividing by c gives the energy density in a frequency band U, = 
dE/dA (cdt)dv. For isotropic radiation, the integration over dQ is same 
as multiplying by 47 so that U, = (4r/c)I, which becomes, for thermal 
spectrum, U, = (4a/c)B,. Integrating this energy density over all v, we 





3 
B, =8.7x 107? (=z) f(x) erg em~? s-! Hz™' st“; f(z) = 
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get the total energy density of blackbody radiation to be 


5 3 
fo (=) koT = aT ~ ( 


on -3 
Ta erg cm”. (2.20) 


T 4 
3400 z) 

Numerical work with these quantities can be confusing both because of 
the steradian factor and because of the frequency dependence. Your answer 
will change if you use B,, defined with respect to w = 2rv by a factor 27; 
and will change quite a bit more if you use wavelength rather than frequency 
and work with B}. One sensible way of handling this (which, unfortunately, 
has not yet been universally adopted), is to work with vB, which gives the 
intensity per logarithmic band in frequency. Quite obviously, vB, is the 
same as wB,, and more importantly, it is the same as AB). In logarithmic 
band, we have the numerical results: 


T \' -3 
vU, = (zx) g(x) erg cm™?; g(x) = — —— 


tT eH oie 
vB, =1.8 (sox) g(x) W m™ st. (2.21) 
The quantity vB, reaches a maximum value around hy = 4kgT which 
translates to the fact that a blackbody at 6000 K will have the maximum 


for vB, at 6000 A. The maximum intensity is 


4 
(v By) max = 1.3 (sox) Watt m`? ster~! 


= 3 x 107'°TS erg cm~? s7! arcsec”. (2.22) 


Finally let us consider the amount of radiation which is leaking out of a 
small area on the surface of a blackbody (say, a star). If the energy density 
is U [erg cm~*}, then the flux [erg cm~? s7!] is given by (dE/dtdA) = 
(1/4)cU; a factor (1/2) arises from counting only the flux which is going 
outward and the other factor (1/2) comes from averaging cos@ to get the 
normal component of the flux in the forward hemisphere. It is usual to 
write this result as (dE/dtdA) = oT* where 


ac 275 kb -8 a 
Using the relation B = (c/4r)U, we find that B = (ca/4m)T* = oT4t/r 


so that F = 7B (also see exercise 2.2). It is clear from (2.17) that there 
are very few photons with momentum greater than j ~ (kgT/c), so that, 
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(N/V) = (41/3)(p/2mh)? ~ (kpT/hc)® and the mean energy is Ugr © 
kpT(N/V) ~ (kpT)*/(fic)3. 

The expression for n (w) = (e8 — 1)-!, with @ = (kgT)~1, has sim- 
ple asymptotic forms for hw < kgT and hw > kgT. The hw < kgT 
limit corresponds to the long wavelength, classical (fi — 0), regime of the 
radiation. Equipartition of energy suggests that each mode (having two 
polarization states) should have energy ew = 2 x (kgT/2) = (kgT) or 
Nw = (€y/hw) = (kpT/hw). The quantity n(w) does reduce to (kgT/hw) 
in this limit. At these low frequencies, the intensity of thermal radiation 
given by Eq. (2.18) will be B, ~ (2kgT/A?). Because of this relation, it is 
conventional to define a brightness temperature for any source with inten- 
sity I, as Tg = (A7I,,/2kg) which is (in general) a function of frequency. 


The iw > kpT limit, on the other hand, corresponds to the regime 
in which photons behave as particles. In such a case we expect ny = 
exp (—fw/kgT) based on Boltzmann statistics. In this limit, nuy << 1 and 
quantum statistical effects are ignorable; hence we get Boltzmann statistics 
rather than Bose-Einstein statistics. The limiting domains lw < kpT 
and hw >> kgT are called the Rayleigh-Jeans regime and Wien’s regime 
respectively. 

From our general results Eqs. (1.49) and (1.50), we can also compute 
the fluctuations in the energy of the photons: 


JE _ 102 


2 
-35 = za (za) = (E?) — E? = (AE)’. (2.24) 


Z 0p 


In the case of a photon gas with FE = hw (ef — Dg direct differentiation 


gives 
(An)? = ($2) = 2): + (=) =n? +n. (2.25) 


If photons were to be interpreted as particles, then one would expect 
(An)? ~ n giving the usual Poisson fluctuations of (An/n) ~ n~!/?. For 
this to occur it is necessary that n >> n?; that is, n < 1 which requires 
Bhw > 1. On the other hand, if Bhw < 1 then n > 1 and (An)? ~ n?, 
which characterizes the wave-like fluctuations. Thus, you can think of ther- 
mal photons as particles when w >> kgT and as waves when lw < kpT. 

Thermal radiation arises in astrophysics when a primary source of en- 
ergy is thermalized due to some physical process, the most important ex- 
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ample being stellar radiation. Given the luminosity L and the radius R 
of a star, its effective surface temperature is determined by the equation 
L = (4nR?)(oT4,). For the Sun, this gives Teg œ 5500 K; the Planckian 
radiation corresponding to this temperature will have peak value for vB, 
around à ~ 6500 A. The flux of radiation from such a star, located at a 
distance of 10 pc (where 1 pe œ 3 x 1018 cm), will be about F = 3 x 10710 
W m7’. It is conventional in astronomy to use a unit called the bolo- 
metric magnitude, m, to measure the flux where F and m are related by 
log F S —8 — 0.4(m — 1) when F is measured in W m~?. When the flux 
changes by one order of magnitude, the magnitude changes by 2.5. (We 
will say more about magnitude scales in chapter 4.) A Sun-like star at 10 
pe will have a magnitude of about 4.6. 

Another important example of thermal radiation, which is of cosmolog- 
ical significance, is the cosmic microwave background radiation which is a 
relic from the early hot phase of the universe. It turns out that the major 
component of the background radiation observed in the microwave band 
can be fitted very accurately by a thermal spectrum at a temperature of 
about 2.73 K. It seems reasonable to interpret this radiation as a relic 
arising from the early, hot, phase of the evolving universe. The vB,, for 
this radiation peaks at a wavelength of about 1 mm and has a maximum 
intensity of 7.3 x 10-7 W m`? rad~?. The intensity per square arc second 
of the sky is about 1.7 x 10717 W m7? arcsec~? ~ 23 mag arcsec™?. We 
will discuss the properties of this radiation in detail in Sec. 6.6. 


2.3 Monochromatic plane wave 


Having discussed the production of radiation in the classical and quan- 
tum regimes, let us next consider a few important facets of propagation of 
radiation. The simplest type of classical radiation one could study corre- 
sponds to an electromagnetic wave at distances far away from the source of 
emission — like, for example, the light we receive from cosmic bodies. At 
large distances from the source, the wavefront is approximately planar and 
for simplicity, we will assume that the radiation is monochromatic with a 
frequency w. 

If we take the direction of propagation to be n, then the plane monochro- 
matic wave is characterized by the wave vector k = (w/c)n. The wave vec- 
tor k and the frequency w form the components of a four vector just as x 
and £ form the components of a four vector. (You can get this directly from 
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the fact that the phase (wt — k- x) of a wave should be Lorentz invariant. 
Or, by noting that E = fw and p = hk are the energy and momentum 
in the photon picture.) Since the components of this four vector deter- 
mine both the frequency and direction of propagation of the wave, different 
observers will see the wave to have different frequencies and directions of 
propagation. 

As a specific example, consider two Lorentz frames S and S$’, with S’ 
moving along the positive z-axis of S with velocity v. We will call S’ 
the rest frame and S the lab frame and denote quantities in these frames 
by subscripts R and L. (You might think that we got the nomenclature 
switched; usually some object will be moving as seen in the lab frame and 
if we go go to a frame which is moving with the object, it is conventional to 
call it the “rest” frame!) To bring out the relativistic effects as dramatically 
as possible, we will assume that v is close to c so that 


vy ewe) 1 
(1 A ~ (l+u/e) ~ 27 (728) 
with y= (1- v2 /c2) 1? >1. 


Let us look at a plane wave with frequency wy traveling along the direc- 
tion (82, ¢z) in the lab frame. The four-vector k? describing the wave will 
have components (wz/c,kz) in the lab frame and (wr/c, kpr) in the rest 
frame. For motion along xr—axis, only the angle made by k with z-axis 
will change; the azimuthal angle is clearly invariant due to the symmetry. 
Writing ck? = wy cos6,,ck% = wpcosĝôpr and using the Lorentz transfor- 
mations for the four vector kt, we find that 


WR = ywr[1l — (v/c) cos Oz); (2.27) 

wrcosôr = ywrlcos Oz, — (v/o). (2.28) 

These equations relate the directions of propagation and the frequencies in 
the two frames. 


Using the Eqs. (2.27) and (2.28), we can express upr = cosp in terms 
of ur = cosĝ; : 


ur — (v/c) 
= =. 2.29 
R= 1 pA ee 
Figure 2.1 shows a plot of wr against uz for two values of v/c. When 
uL = 1 we have upr = 1 and when py, = —1 we have ur = —1; that is, 


waves traveling along the z—axis appear to do so in both the frames. (The 
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Hp = cos Ôk 





t = cos 6 


Fig. 2.1 Relation between cos@p and cos@z, for two values of v/c. The directions of 
two waves in the rest frame and the lab frame are indicated by thin and thick arrows 
respectively. 


directions of two waves in the rest frame and the lab frame are indicated 
by thin and thick arrows respectively, inside the boxes in the figure) The 
frequencies, however, are different in the two cases. When uz = 1, wR = 
yw (1 —(v/c)), so that wr < wz; the radiation is blue shifted as seen in 
the lab frame. For uz = —1, wr = ywz (1 + (v/c)) and the radiation is red 
shifted. 

When uz = (v/c), we have ur = 0. Since v is close to c, we expect 
uL = cos 0, ~ (1 — 62/2) with 6, <1. Comparing with Eq. (2.26) we find 
that 0, ~ y7! < 1. Hence this direction of propagation is almost along 
the positive x—axis in the lab frame. But since wr = 0, it is clear that 
Ör = 7/2 and the wave was propagating perpendicular to the x-axis in 
the rest frame. Thus, a wave emitted in the direction perpendicular to the 
direction of motion will be ‘turned around’ to (almost) forward direction by 
the relativistic motion of the source. Similarly, when ug = 0, ur = —(v/c). 
In this case, a wave which has been traveling almost along the negative 
x—axis has been “turned around” to travel orthogonal to the z—axis. It is 
clear from the arrows in the little boxes in Fig. 2.1 that the wave will appear 
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to propagate forward in both frames only if 8z < y~!. This phenomenon 
is called relativistic beaming. 

In the two cases discussed above, the frequency of the wave changes. 
When uz = (v/c), wr = wyy! and the wave is still blue shifted. When 
uL = 0,wR = ywz and the wave is red shifted. To find the angle of 
propagation at which there is no frequency change, we can set wL = WR 
and solve for u. This gives ĝe ~ y~!/2. For 0 < ĝe wave is blue shifted. 
A wave propagating along 0 = 0, appears to make the “same” angle with 
respect to x—axis in both the frames. 

The above discussion shows that the motion of a source “drags” the 
wave forward. Waves originally traveling in the forward hemisphere in the 
rest frame (|92| < 7/2) are beamed into a small cone of opening angle y~! 
around the forward direction. Further almost all the energy flowing out 
(except those in small cone of opening angle y~! around the backward di- 
rection in the rest frame) are flowing into the forward hemisphere in the lab 
frame. Hence a charged particle, moving relativistically, will “beam” most 
of the radiation it emits in the forward direction. In exactly same manner, 
you can show that a charged particle moving relativistically through an 
isotropic bath of radiation will see most of the radiation as hitting it in the 
front. These results will be used in the study of several radiative processes 
later on. 

So far we discussed the propagation of the wave in the vacuum. This is 
also a good place to introduce some aspects related to propagation of radia- 
tion through matter. In general, this can be described (phenomenologically) 
using some general parameters called emissivity, opacity etc.. Once these 
quantities are obtained from fundamental physics, the actual application 
to an astrophysical phenomenon proceeds along the following lines. 

If the cross section for some process under consideration (which scatters 
or absorbs radiation) is given by ø, and the number density of scatterers 
is n, then the mean free path of a photon is 1 = (no)~1. In astronomy, 
however, it is conventional to work with a quantity « called opacity (with 
units cm? g~! in c.g.s. units) which is defined as: 

no 
K F (2.30) 
where p is the mass density of the scatterers. (The opacity which will play 
a key role in later chapters.) The optical depth of a system of size R is 
defined to be 7 = (R/l) = a R where a = no. From the standard theory 
of random walk, we know that a photon will traverse a distance R with N, 
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collisions where R = Ne / 71; that is, the number of collisions is given by 


No = (R/U? = 7? provided T > 1. (When 7 < 1, we can think of 7 as the 
probability for the process to take place over a distance R.) In general o, a 
etc will all depend on the frequency and we will indicate it as o,,a, etc. 
with a subscript v when required. 

To give a flavour of the formalism based on @,, let us consider a sim- 
ple representation of the propagation of radiation in a medium which is 
emitting and absorbing radiation. We shall ignore all the scattering effects 
and also any spatial dependence of the processes. Then, when the radia- 
tion travels a distance dl, the intensity J, will increase by the amount j_dl 
due to emission of photons where j, is called emissivity of the medium. 
The absorption of the photons will cause the intensity to decrease by the 
net amount (—a,J,dl) during the same time. Hence the rate of change of 
intensity can be described by a differential equation of the form 


dl, 
dl 


if scattering is ignored. This has the solution 


=j,-w, (2.31) 


IL, = 1(0)e~! + [1 -e 4]. (2.32) 
v 

which satisfies the boundary condition J, = 7„(0) at 1 = 0. So we are 
considering a situation in which some radiation of strength I,,(0) is incident 
from the left on a slab of material which extends from l = 0 tol = R, 
say. The slab of material is characterised by an emissivity j, and opacity 
Ky = a,/p. If the path length of the light through the source is R, then 
the emerging intensity at | = R has two limiting forms: 


I, (R) = L(0)e~ee Ph & [1 — 


Qy 


ew) ice +jR (if T=a,R <1) 
(jv /@v) (if 7 =a,R>1). 

(2.33) 

Equation (2.33) shows several interesting features. Let us first consider 
the case of J,(0) = 0 so that there is no incident radiation and all the 
emission is from the slab itself. Since a7! is the mean free path for the 
process (governed by the cross section o,) the condition tr, = œR < 1 
implies that the mean free path is much larger than the source size and 
the absorption is not very effective. (The source is then said to be optically 
thin.) In that case, the limiting form in Eq. (2.33) with I, (0) = 0 gives 
I, = jy R and we directly observe the source emissivity j, unhindered by 
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absorption processes. In the other extreme limit, when a,R > 1, (called 
optically thick), the observed intensity is the ratio between the emissivity 
and the absorption coefficient of the source. If the emission arises from 
matter in thermal equilibrium, then this ratio is fixed and must be equal 
to the intensity of the black body spectrum, B,, completely independent of 
the details of the process. (An optically thick medium in thermal equilib- 
rium must always emit the blackbody spectrum irrespective of the specific 
process involved in the emissivity j,.) We thus find the universal relation 


jv = Bva (2.34) 


between the absorption and emission characteristic at a given frequency v 
for material in thermal equilibrium. For example, if we know the emission 
rate at a given frequency, we can determine the absorption rate — a feature 
which we will have occasion to use repeatedly. In thermal equilibrium, 
Eq. (2.33) with I,(0) = 0 gives I, = B,[1 — exp(—7,)] so that I, is always 
less than B,. No material system in thermal equilibrium can emit at a 
given frequency v more intensity than a blackbody can emit at the same 
frequency. Further, if B, can be approximated by the Rayleigh-Jeans Limit 
— as often happens in radio astronomy — then the same relation holds for 
the brightness temperature: T = Tsource{1 — exp(—7_)]. 

The situation becomes more interesting when we study Eq. (2.33) with 
I,(0) # 0. Writing the solution as J = I,(0)e~™ + S,(1 — e~™) where 
Ty = œR is the optical depth through the medium, Sy = j,/a, we find that 
the resulting intensity depends on relative values of S, and J,,(0). Consider 
first a situation with S, > I,(0) which could, for example, correspond to 
cool incident radiation passing through a hot gas with a source function S,. 
Then Eq. (2.33) can be rewritten as I, —I,(0) = (S,—I,(0))(1—-e7™). This 
reduces to the limiting form of S, — I, (0) at large optical depths (7, > 1) 
and (S, — JI,(0))7, at small optical depths (r, < 1). When the optical 
depth is small, one sees the continuum background with bright emission 
lines at those frequencies for which S,7, is large. These intensities will 
increase with 7, and eventually saturate at S, at large optical depths. In 
this case, we always have I, < Sp. 

The other limit of S, < [,(0) corresponds to hot radiation propagating 
through cool gas which, for example, occurs in the case of Sun. Now I, — 
Sy = UL(0) — SLje~™ showing S, < I, < I,(0). In this case, we get 
absorption lines which dips below the continuum background of I,,(0) at 
small optical depths; as the optical depth increases, the absorption becomes 
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more prominent and in the saturated case, we detect the intensity S, due 
to the foreground. 

The opacity of matter to the flow of radiation also measures the amount 
of radiation that is absorbed by matter. When this happens, the radiation 
will exert a pressure on matter depending on the opacity. To obtain an 
expression for the radiative pressure, we note that if the flux of radiation 
energy is F the corresponding momentum flux is F/c. Since the momentum 
absorbed by the matter per second is the force, which should also be equal 
to the pressure gradient of the radiation, we get 


F _ PRY _ 4 3 
() no =F (=) = -VPR = -3aT°VT (2.35) 


where the first relation comes from Eq. (2.30) and the last relation comes 
from Pr = (1/3)aT* for the radiation. (The sign shows that the flux is 
from the high temperature region to the low temperature region.) This 
allows one to relate the temperature gradient in matter to the radiation 
flux 


3 pK 1 
VP =-7 785 (2.36) 
which will be of use in the study of stellar structure (see chapter 4). 

As a simple application of the radiation pressure arising from the opac- 
ity, consider the force exerted by the radiation emitted by a body of lumi- 
nosity L on matter around it. Since F = (L/4mr?) at a distance r from the 
body, matter will be pushed out if the radiative force exceeds the gravita- 
tional attraction. The condition for this not to happen (so that matter can 
accrete onto the central mass) is 








L Kp GM 
A4nr2 ee r2 1) 
which translates to the condition on luminosity 
4 
L<Lgp= ( zz) M. (2.38) 


This quantity, Lp, is called Eddington luminosity and is the maximum pos- 
sible luminosity that can be achieved in simple accretion process. (Strictly 
speaking, astronomers define Lg with « arising due to scattering by free 
electrons in the medium; but, of course, the above argument is valid for 
any K.) It plays a crucial role in the study of quasars and active galactic 
nuclei. 
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2.4 Astrophysical radiative processes 


We shall now apply these considerations to some specific astrophysical pro- 
cesses in which radiation is emitted due to different acceleration processes. 
Among the classical (or nearly classical) radiation processes, the most im- 
portant ones are thermal bremsstrahlung, synchrotron radiation and inverse 
Compton scattering with which we will begin our discussion. 


2.4.1 Thermal bremsstrahlung 


As a first example, let us study the radiation emitted by a fully ionised 
hydrogen plasma at temperature T, when the electrons are constantly scat- 
tered by the protons. In a scattering between an electron of mass Me and a 
proton, with an impact parameter 6 and relative velocity v the acceleration 
of the electron is a ~ (q?/m-_b?) and lasts for a time (b/v). Using Eq. (2.7) 
we can estimate the total energy emitted in such an encounter to be: 


E x (q’a"/c3)(b/v) ~ (q°/c3m2b3v) & (q6n;i/c2m?2v) (2.39) 


since b  n, 1/3 on the average. (This ignores distant encounters and a more 


precise calculation will require integrating over all impact factors leading to 
a logarithmic correction, as in Eq. (1.64). This is, however, adequate for our 
purpose.) The factor m? in the denominator explains why we can ignore 
the radiation emitted by the ions in the scattering, since it is sub-dominant 
by a factor (Mme/Mp)?. We have also ignored the scattering of electrons 
by electrons and ions by ions. The reason for concentrating on electron-ion 
collisions is the following: For a system of non relativistic charges with same 
charge-to-mass ratio, (q/m), the second derivative of the dipole moment 


es Š qi a q . 
d= So aki = 5 (=) MiXi = Pa X Pi, (2.40) 
vanishes due to momentum conservation. Hence the dipole radiation from 
electron-electron collision and ion-ion collision will be sub dominant to the 
effects from electron-ion collisions as long as the system is non relativistic. 
(You can’t write p; = m;X; for relativistic particle; we will not, however, 
discuss relativistic plasmas.) 

The total energy radiated per unit volume will be n.€. Since each colli- 
sion lasts for a time (b/v), there will be very little radiation at frequencies 
greater than (v/b). For w < (v/b), we may take the energy emitted per unit 
frequency interval to be nearly constant. Further, in the case of plasma in 
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thermal equilibrium, v = (kgT/me)!/?. Putting all these together, we get 





dE Mec? nee 
I= (žr) = (mec*)rg (Es) neni = Kn?T-/?; (hw S kT) 
: (2.41) 


where ro = q?/mec? is called the classical electron radius and K = 
(mec?)r3(mec?/kg)'/? is just a constant. This process is called thermal 
bremsstrahlung. The bremsstrahlung spectrum is nearly flat for 0 < w < 
(kpT/h) and will fall rapidly for w > (kgT/h), where the upper limit comes 
from the fact that an electron with a typical energy of (kgT) cannot emit 
photons with energy higher than (kgT/h). A more precise expression is 


given by 


jv = Kg(v,T, Z)Z?nen;T ~/? exp (-2) (2.42) 
kgT 
where g is called the Gaunt factor and Z? takes care of the ionic charge 
being Zq in case the gas is not hydrogen. 
The total energy radiated, over all frequencies, from such a plasma can 
be found by integrating Eq. (2.41) over w in the range (0, kBT/ħ). This 
gives 








(kBT/ħ) 1/2 
"a p J dw 2i > (m c*)r3 mec? ket. Neni 
dtdV j} dwdtdV ]~ >` © lO A (mee ne 
0 
= 1.4 x 1077'n2T!/? erg s-tem73, (2.43) 


The bremsstrahlung radiation emitted by the plasma takes away the energy 
of the system and thus can be an effective cooling process at high tempera- 
tures. The characteristic time scale for cooling of a thermal plasma is given 
by the the ratio between the thermal energy density nekgT and the energy 
lost per second per unit volume of the plasma obtained above. This gives 


_ _ nekpT 1 Sp fRBTN P a 1/2 
teool = TIE /ataV) ~ (z). (= xne T (2.44) 





where a = (q?/ħc) is the fine-structure constant. Putting in numbers, we 
get 


nekBT T aid Ne —1 
t e A n D S — ; , 
HEEZE 10-27T1/2n2 á (sex) on E Gyt. (2.20) 
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Thermal bremsstrahlung occurs in several astrophysical systems lead- 
ing to radiation in completely different wave-bands. One application of 
our result is in the case of regions around hot stars which are completely 
ionized by the UV photons from the star. Such a local island of plasma 
in the interstellar medium (ISM) is called a HII region.! They exist in the 
spiral arms of our galaxy and emit thermal bremsstrahlung radiation with 
a relatively flat spectrum in the radio band. For example, the HII regions 
in Orion have T œ 104 K, n; ~ 2 x 103 cm 73 and an effective line of sight 
thickness of 6 x 1074 pc. From Eq. (2.41) we can estimate the flux to be 
about 3 x 1072! erg m~? s~! Hz! = 300 Jy where 1 Jy = 10-26 W m~? 
Hz~1. This is typical of emission from HII regions. 

Another prominent source of bremsstrahlung radiation is the hot gas 
in clusters of galaxies. Taking the cluster gas to be fully ionized hydrogen 
with a mass of about Mgas = 1048 gm spread over a sphere of radius R ~ 3 
Mpc, one can estimate the mean number density of ions and electrons as 
Ne © (Mgas/MpV) % 1074 cm~? where V is the volume of the cluster. The 
virial temperature of the gas is kgT ~ (GMm,/R) giving T ~ 108 K. Using 
Eg. (2.43) and the estimated values of ne and T, we get the total luminosity 
from the whole cluster to be £ = LV = 5 x 1044 erg s~!. This radiation 
will be in the wave-band corresponding to the temperature of 108 K which 
is in x-rays. Several such x-ray emitting clusters have been observed. In 
fact, the gas mass within about 500 kpc of the center of the cluster is fairly 
tightly correlated with the x-ray luminosity: Mgas(< 500 kpc)/1014Mo % 
(0.3L,,/10* erg s~'). 

Using Eq. (2.45) we can also estimate the cooling time for the X-ray 
emitting gas in such a cluster. While the mean number density of electrons 
computed above is about 1074 cm~3, the central densities can be higher by 
a factor of few hundreds giving ne ~ 0.01 cm~? which is used in Eq. (2.45). 
This will make central regions of the cluster to cool rapidly (within a life- 
time of a few Gyr) thereby reducing the central pressure support. This, 
in turn, could cause flow of gas from the outer regions towards the cen- 
ter. Investigation related to such cooling flows (including the question of 
whether they really exist!) has been a major area of research in the study 
of clusters. 

1This is a good place to let you know of one of the many crazy notations used by 
astronomers. The symbol HI stands for neutral hydrogen while HII stands for singly 
ionized hydrogen — that is, proton. Similarly CIV, for example, will stand for triply 
ionized carbon and CI will represent neutral carbon atom — which, mercifully, is not 


often used. In general, if you see (element symbol) (Roman numeral for n) you should 
take it to mean the element with (n-1) electrons removed. 
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We can use our results to study another important process called the 
free-free absorption which is a key source of opacity in plasma. This is the 
‘time reversed’ process of bremsstrahlung, in which a photon is absorbed 
by an electron while in the proximity of an ion. In equilibrium, the rate 
for this free-free absorption should match that of thermal bremsstrahlung. 
We write the free-free absorption rate as nog B(v) where og is the free- 
free absorption cross section and B(v) œ v3(e/FeT — 1)-! œ v?T (when 
hv < kpT which is usually the relevant range in the astrophysical contexts) 
is the intensity of thermal radiation. Equating this to the bremsstrahlung 
emissivity found in Eq. (2.41), jy x n?T~1/2, we get og œ (n/v?T?/?). 
With all the numerical factors added, this result corresponds to a, = nog 
in the Rayleigh-Jeans regime given by: 





rå (Mec? EP 2 2m—3/2,~2 
z —> =n = = ais 2.4 
we > (Es) an Cn T" ty (2.46) 
This result shows that bremsstrahlung photons emitted by a source can 
be self absorbed at small frequencies at which a œ v~? can be large. 
More specifically, consider a source of size R at a distance d emitting 
bremsstrahlung radiation. Using the result in Eq. (2.32), we find the flux 
to be 


F, = 2kpT (=) 2), (1 — exp [-cT-9/?n2v-?R] ) ; (2.47) 


At low frequencies, the exponential term vanishes and F, œ v?; at high 
frequencies, the exponential can be expanded in a Taylor series and the 
leading term will cancel the v? in the front leading to a constant flux in- 
dependent of v. The turn over frequency is v? = CT~3/?n?R which can 
be determined entirely in terms of source parameters. Such a turnover in 
the spectrum with v? dependence changing over to constant flux has been 
observed in several astrophysical sources. 

In most cases of interest, we can take the typical frequency of the photon 
involved in the process to be proportional to T, so that og «x nT73°5. 
Adding the numerical constants, the corresponding opacity can then be 
written in the form 


kg x pT 3° ~ 10% pT-3* em? gm™t. (2.48) 


This is usually called Kramer’s opacity and plays a key role in stellar mod- 
eling. 
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2.4.2 Synchrotron radiation 


Another major source of acceleration for charged particles is the magnetic 
fields hosted by plasmas. The radiation emitted by this process, called 
synchrotron radiation, can be estimated as follows: Consider an electron 
moving with velocity v in a magnetic field B. To use Eq. (2.7) we need to 
estimate the acceleration in the instantaneous rest frame of the particle. In 
that frame, the magnetic force (¢/c)[v x B] is zero; but the magnetic field 
in the lab frame will lead to an electric field of magnitude E’ = (v/c)7B, 
in the instantaneous rest frame of the charge, inducing an acceleration 
a’ = (qE'/me). Accelerated by this field, the charged particle will radiate 
energy at the rate (which is the same in the rest frame and lab frame) 


dE dé! 2q7, 2 2q gv? 
w= (Gr) = 3500" = samai PR 
Thus the power radiated by an electron of energy € = ymec? scales with the 
energy as (dE/dt) œ v?e? B?. For slow speeds (v < c), the energy € ~ mec” 
is a constant and the power is proportional to the kinetic energy of the 
particle; at relativistic speeds, v/c ~ 1 and the power is proportional to 
the square of the particle energy. Hereafter, we shall consider the extreme 
relativistic case with v/c = 1 and y > 1. 

Since the energy density in the magnetic field is Ug = (B?/87) we can 
also write the above result in a more suggestive form as: 


2 2 
($) c Dr ( Ei ) Up ~ (org) y? (2.50) 





dt 3 (mec? 


where or = (87/3)(q?/mec?)* is called the Thomson scattering cross sec- 
tion [which we will discuss in the next section; see Eq. (2.61)]. The time- 
scale for energy loss by this process can be found by dividing the energy of 
the electron (ymc*) by (dE/dt); this gives 


B €e \7i 
teyne = (ag fat) ~°* 10% [y7 Bases] = 100" yr (o) (ay) i 
(2.51) 
for the extreme relativistic case. (More formally, Eq. (2.50) for a single 
electron gives (de/dt) œ e? which integrates to give e(t) = e(0)(1 + t/r)~' 
where 7 = 3m2c3/4ore(0)Ug. This is essentially the same time scale as in 
Eq. (2.51)). 
Let us next consider the spectrum of synchrotron radiation from a highly 
relativistic charged particle. For a non relativistic particle spiraling in a 
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magnetic field, we have (mv?/rL) = qB(v/c) where rz, called the Larmor 
radius, is the the radius of the orbit. The characteristic angular frequency 
is wz = (rz/v) = (qB/mec). For relativistic motion with constant v?, 
the effective mass is me(1 — v?/c?)~!/? = mey so that the corresponding 
angular frequency becomes wz = (¢B/mcy) = (qcB/e) for a particle of 
energy € in a magnetic field field B. Based on this consideration, one 
would have expected the synchrotron radiation to peak at this frequency wz 
but actually the synchrotron radiation from an extreme relativistic particle 
peaks at the frequency 


We Mwy? x By? « Be (2.52) 


where the extra factor y° arises from special relativistic effects. One factor 
of y arises purely from time dilation factor while the factor y? arises from 
the fact that radiation is beamed mostly in the forward direction. This 
latter factor can be understood as follows: From the discussion in Sec. 2.3, 
we know that most of the radiation from a relativistically moving particle 
is emitted in the forward direction into a cone with opening angle 


J212 7 me 
asx (1-5) xy m (2.53) 


An observer will receive the radiation only when the direction of observation 
is within a cone of angular width A0. To relate the time of arrival of 
radiation, t, to the angle 6, let us consider the radiation emitted at two 
instances tı = 0 and tg = t when the charged particle was at positions 
6, = 0 and 62 = 0. Clearly rg@ = v (t2 — tı) = vt giving t = (rp6/v). 
The arrival time of the pulses at the observer will be less by the amount of 
time (rg6/c) taken by the radiation to travel the extra distance (rp@). So 
tobs ~ t — rpO/c = (rg6/v) (1 — v/c). Hence 


Vtob. 1 2vtob. 
obs “gy 23 y2 2 


fao S Tg TZA ~ re 





(2.54) 


This gives an extra factor of y? in Eq. (2.52) which essentially arises from 
the relativistic beaming. The exact spectrum for synchrotron, F (w/w), 
behaves as 
F(w/we) = SU ie) ar 2s) (2.55) 
(w/we)/ (for w K we.) 
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In fact, most of the energy will be emitted at frequencies close to we x Be? 
where € is the energy of the charged particle. It follows that € x wi/*B-1/2, 
Numerically, 


e€ © 4y)/? B1? erg (2.56) 


when all quantities are in c.g.s. units and v = w/2r. Using this relation, 
Eq. (2.51) can be expressed as 


B X) 7 v yo 
tne S 8 ao aoe , 
PETAMI (a) G a) 


This is lifetime (of a synchrotron source emitting with the peak frequency 
vc), over which the charged particles would have lost most of their energy, 
unless other physical processes replenish it. 

Let us now move on from the radiation emitted by a single electron to 
that emitted by a bunch of electrons with a distribution in the energies. If 
n(e)de denotes the number of electrons with energies in the range (e, €+ de) 
then the total radiation emitted by this bunch of electrons will be jọ œ 
(B7e?)(n(e))(de/dw) where the first factor Be? is the energy emitted by a 
single electron, the second factor is the number of electrons with energy e 
and the last factor is the Jacobian (de/dw) x €~! œ w7!/? from e to w. If 
the spectrum of electrons is a power law n(e) = Ce~?, then the radiation 
spectrum will be 


(2.57) 


e3 3e (p—1)/2 
A (5s) C BOT /2 y-@-0)/2, (2.58) 


mec? \A4nmac 


where we have reintroduced all the constants. (A more precise calculation 
multiplies the expression by a p dependent factor which is about 0.1.) This 
leads to a power law spectrum jp x v7® with a = (p — 1)/2 allowing 
us to relate the observed spectral index a to the spectral index p of the 
particles producing the radiation. This is a standard procedure used to 
explain sources which are observed with a power law spectrum. 

Several discrete sources (supernova remnants, radio galaxies, quasars, 
..) emit radio waves essentially through synchrotron process. As a spe- 
cific example, consider the diffuse radio background in our galaxy due to 
synchrotron radiation from electrons in the interstellar medium. Observa- 
tions indicate that the spectrum of the electrons is given by (dN/dE) ~ 
3 x 1071"! E-?-3 particles cm~? if E is measured in GeV. If the magnetic 
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field is about 6 x 1078 Gauss, Eq. (2.58) will predict a volume emissivity of 


Y 


SL 
wu) ergs’! cm? Hz}. (2.59) 


jv ~ 3 x 10738 ( 
This is close to the observed background emission. Over a line-of-sight of 
1 kpc, this will give a flux of 107°? W m~? rad~? Hz~!. Similar power law 
radio spectra are seen in the case of radio galaxies, quasars etc. and are 
thought to be due to synchrotron radiation. 

An interesting application of synchrotron cooling is to the Crab nebula 
which is a supernova remnant that exhibits power law spectrum F, « v~° 
in radio frequencies. The spectrum has a slope a œ% 0.3 for v < 10° GHz 
and a steeper slope of a ~ 0.8 for v > 103 GHz. The flux is about 10° 
Jansky at v = 1 GHz. This observed “break” in the spectral index (i.e. 
the change from a = 0.3 to a = 0.8) can be nicely understood in terms of 
a model in which a power law spectrum of electrons with the distribution 
n(e) x e7? are injected at t = 0 and are continuously losing energy due 
to synchrotron radiation till today (t = T). If the synchrotron loss time 
is tsync, then for tsync > T, we expect the spectrum to retain the original 
shape with n(e) x Te~?. On the other hand, at high energies at which 
tsync < T, the equilibrium between injected energy and radiation loss will 
require n(€) « tsync€”. But since tsync X 7! (see Eq. (2.51)), we find 
that n(e) x e~ +?) at high c, i.e., the electron spectrum steepens by unity. 
Since the spectral index of synchrotron emission is œ = (p — 1/2) (see 
Eq. (2.58)), it steepens by 0.5 which matches with the observations. (While 
this idea is reasonable, it is correct only within an order of magnitude. 
This model predicts that the break frequency va should occur at a point 
where tsync = T. Setting v to the characteristic frequency of synchrotron 
radiation, one can use this relation to estimate the age of the Crab nebula 
and we get T = 2000 years, which is somewhat on the higher side.) 

These calculations assume that the radiation emitted by the source 
reaches us without any modification. In general, some of the radiation 
will be absorbed — especially at low frequencies — thereby modifying the 
spectrum. For thermal bremsstrahlung we found that this effect leads to 
a v? dependence at low frequencies (see Eq. (2.47)). In the case of syn- 
chrotron radiation, absorption makes the low frequency spectrum acquire 
a universal form, with j, œ v°/?. This scaling has a simple physical inter- 
pretation. You will recall that the low frequency thermal radiation has the 
form B, ~ (2v?/c*)kgT in which the first factor (2v2/c?) comes from phase 
space and the second factor (2 x (1/2)kgT) comes from the mean energy of 
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the photons. In the case of a synchrotron source, the phase space factor, of 
course, remains the same but kgT should be replaced by the mean energy 
€ of a synchrotron electron emitting radiation at the frequency v. From 
Eq. (2.56) e x v}/2B-1/? giving 


2u? v \ 
I, x B75 z% (=) (=) Mec? (2.60) 


which explains the (5/2) power. In the last equality we have reintroduced 
the constants of proportionality and expressed the result in terms of the 
Larmor frequency vz. 

Synchrotron radiation will be linearly polarized with a high degree of 
polarization. To see this, consider the radiation emitted by a single electron 
circling in a magnetic field. When the electron is viewed in the orbital plane, 
the radiation is hundred percent linearly polarized with the electric vector 
oscillating perpendicular to the magnetic field. When viewed along the 
direction of the magnetic field, the radiation is circularly polarized. (If it is 
right handed circular polarization when viewed from the top, it will be left 
handed circular polarization when seen from the bottom). Consider now 
the relativistic motion of the electron which beams the radiation within 
a narrow cone in the direction of motion. The radiation is now mostly 
confined to the plane of the orbit and, in any realistic situation, there will 
be a bunch of charged particles with a distribution in the pitch angle. In 
that case, the two components of the circular polarization will effectively 
cancel while the the linear polarization will survive to a significant extent. 
Hence the radiation will be linearly polarized to a high degree. For example, 
the radiation at v ~ 1073, will be about 50 per cent polarized while the 
rare photons with v x 10v. will be almost completely polarized. If the 
emission arises from a power law spectrum of charged particles with index 
p, then the degree of polarization can be computed to be (p+ 1)/(p+ 7/3). 


2.4.3 Inverse Compton scattering 


Another important process by which astrophysical systems generate high 
energy photons is by scattering of photons by highly relativistic electrons. 
In such a process, significant amount of energy can be transfered from the 
electrons to low energy photons thereby converting them into high energy 
photons. We will now see how this comes about. 

The simplest form of scattering of photons by charged particles occurs 
when a charged particle — originally at rest — is accelerated by an elec- 
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tromagnetic wave which is incident on it. Consider a charge q placed on 
an electromagnetic wave of amplitude E. The wave will induce an acceler- 
ation a ~ (qE/m) causing the charge to radiate. The power radiated will 
be P = (2q?a?/3c*) = (2q*/3m?c*) E?. Since the incident power in the 
electromagnetic wave was S = (cE? /4r), the scattering cross section (for 
electrons with m = me) is 


2 2 
panan ( = z) = 6.7 x 107cm? (2.61) 
Mec 
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which is called the Thomson scattering cross section. This cross section 
governs the basic scattering phenomena between charged particles and ra- 
diation. The corresponding mean free path for photons through a plasma 
will be lr = (nesr)! and the Thomson scattering opacity, defined to be 
Kr = (n.or/p), is 


ees (=) (=) = 0.4 em? gm7! (2.62) 


for ionized hydrogen with ne = np. 

Incidentally, one key feature of any scattered electromagnetic radiation 
is that it will be polarized even if the original radiation is not. This feature 
can be understood fairly easily: Consider the electromagnetic radiation 
propagating along the positive z—axis and hitting an electron near the 
origin. If the incident radiation is unpolarized, its electric field vector could 
be anywhere in the x—y plane. Under its influence, the charged particle will 
oscillate in the direction of the electric field vector in the z~y plane. When 
viewed from a direction making angle 0 with the z—axis, only the motion of 
the charge perpendicular to the line of sight will lead to the radiation field 
making the scattered radiation in the direction @ to be partially polarized. 
In fact, the radiation in the direction 0 = 7/2, say along the x-axis, will 
be totally polarized. This is because, when viewed along the r—axis, only 
the component of the motion of the charged particle in the y direction will 
contribute to radiation. (If you look at the sky at a direction 90° away 
from the Sun through a polaroid, you can easily verify that the scattered 
sunlight is totally polarized; it is the same effect.) In star forming regions, 
one can often map the polarization of light scattered by the interstellar 
cloud and verify this effect. 

Let us proceed further and study the case in which a bunch of highly rel- 
ativistic electrons are interacting with a radiation field of photons. We can 
actually use the Thomson scattering to study this case as well by the fol- 
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lowing trick: We first make a Lorentz transformation to the instantaneous 
rest frame of the electron, study the scattering using the result obtained 
above and then transform back to the lab frame. 

In the lab frame, the radiation field may be thought of as a random 
superposition of electromagnetic waves with (E?) = (B?) = 8rU where U 
is the energy density. If a charged particle moves through this field with a 
velocity v, then, in the rest frame of the charge, the electric field will have 
a magnitude E’ ~ yE. Accelerated by this field, the charged particle will 
radiate energy at the rate 


dé e dé! a ga? 
dt dt’ 6 





2\2 
~ (5) cy’ E? ~ apc 7è. (2.63) 
The net effect is to convert a photon at frequency vo to a frequency 7% 
and is quite dramatic if y >> 1. Since y = 10° for a 0.5 GeV electron, this 
process can convert a radio (10° Hz) photon to a UV (10!5 Hz) photon or 
a far infrared (3 x 10!2 Hz) photon into an x-ray (3 x 10!8 Hz) photon. 
You can see why inverse Compton scattering — as it is called — is very 
important in high energy astrophysics. 

The lifetime for inverse Compton loss to significantly reduce the energy 
of the charged particle will be 


ee eee 
~ dE/dt ~~ arcy?U" 


As an example, consider a 100 GeV electron moving through the cosmic 
background radiation, which has a temperature T = 2.73 K and the energy 
density Ucsr = aT? ~ 4 x 10713 erg cm~’. Equation (2.64) shows that 
its life time is r < 10’ yr. If you catch such an electron, it should have 
originated from within a region of size cr ~ 3 Mpc. 

Equation (2.63) is similar in form to Eq. (2.50) and the correspondence 
between the two processes runs deep. Just as in the case of synchrotron 
radiation, here also the spectral form is v—‘?-)/? if the electrons have a 
power law distribution in energy with index p. There are several astro- 
physical sources in which relativistic electrons of a given spectrum will be 
responsible for both synchrotron radiation and inverse Compton scattering 
at different frequencies. The electrons in the presence of a magnetic field 
will produce synchrotron photons (which could be, for example, in radio 
band) which will undergo inverse Compton scattering with the electrons 
thereby producing high energy radiation in another band, say x-rays. In 
general, when a radiation field and magnetic field coexist, the ratio between 


? 


T (2.64) 
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the power radiated by inverse Compton process and synchrotron process 
will be in the ratio f = (Usync/Up) where Usynch is the energy density of 
primary synchrotron photons. 

In fact, one can envisage a situation in which the first generation photons 
produced by inverse Compton process undergoes a further inverse Comp- 
ton scattering to produce second generation of high energy photons etc. 
Obviously this will increase the photon density as (1+ f + f? +...) which 
will converge only for f < 1. This leads to an interesting bound on the 
brightness temperature of a region in which the process takes place. To 
make an order of magnitude estimate of the maximum brightness temper- 
ature possible, let us first note that the flux of synchrotron radiation from 
a source of size R at a distance d can be written in two different forms as: 


wR? — 2kpT, TR? 


Se gr ne ae op? 
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cU, D2 


sync Z (2.65) 
where we have evaluated the synchrotron flux as vI, at the critical fre- 
quency at which the flux is maximum and has also introduced the bright- 
ness temperature through the relation I = 2kgT/X?. This gives us the 
energy density of synchrotron photons in terms of the brightness tempera- 
ture to be Usyne ~ (2713 /c*)kgT}. On the other hand, the energy density 
of magnetic field is Ug = B?/8m. We can first eliminate B in terms of 
ve using Eq. (2.52), ve = (q¢B/2nmec)7" and express y in terms of the 
brightness temperature by ymec? ~ 3kgTy. This gives the energy den- 
sity of the magnetic field in terms of the brightness temperature to be 
Up = (wm&c!°/162q7k4)(v2/TA). Thus we find that 


Usyne  (324q7k%, E 
f = Ug = es Vel, . (2.66) 


The condition for instability f > 1 translates numerically to 


Th 2 Ve 
—= == l. . 
Fu z| (sae a) > 20 
It follows that if the brightness temperature greatly exceeds 10}? K at radio 


frequencies, the inverse Compton losses will be very high leading to a bound 
of T, < 10}? K for any radio source. The same result can be expressed as 


y\ 1/5 
kpT, > (=) Mec? (2.68) 
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where A, is the wavelength at maximum frequency and rọ is the classical 
electron radius. 

The inverse Compton scattering obtained above, of course, is only a part 
of the story. When a radiation field is interacting with charged particles, 
there will be energy transfer from charged particles to photons as well as 
from photons to charged particles. The net effect, of course, will be to drive 
the system to thermal equilibrium with equipartition of energies. So far we 
only looked at the transfer of energy from charged particles to photons. 
To obtain the full effect we need to consider both processes which can be 
handled by the following argument. 

Let the energy transferred from the photons to the electrons be AE 
on the average. (We shall omit the symbol ( ) for simplicity of notation, 
even though we are dealing with average values.) When E < mc? (where 
E is the typical energy of the photon) and kgT < mc’, we can expand 
(AE/me?) in a double Taylor series in (E/mc?) and (kgT/mc’), retaining 
up to quadratic order: 


ar +c Z +c kar + EY" 
me lone me? 3 (mne A nee 


E \ (kgT kBT\? 


Interestingly enough, the coefficients (c1,--- ,ce) can be fixed by simple 
arguments: (i) Since AE = 0 for T = E = 0, we must have cı = 0. (ii) 
Consider next the scattering of a photon with electrons at rest. This will 
correspond to T = 0 and E Æ 0. If the scattering angle is 0, the wavelength 
of the photon changes by 


A\ = (=) (1 — cos 8). (2.70) 


(This is just the textbook case of Compton scattering.) Such a scattering 
of a photon, by an electron at rest, is symmetric in the forward - backward 
directions and the mean fractional change in the frequency is (Aw/w) = 
—(AX/X) = —(ħw/mc?); so the average energy transfer to the electrons 
is AE = E?/mc?. This implies that cp = 0 and cq = 1. (iii) If E = 0 
and T # 0 photon has zero energy and nothing should happen; hence 
c3 = cs = 0. So our expression reduces to 


AE BAe E kBT\? 
aa (sa) i (=) (=) ar) 
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(iv) To fix the last constant cs, we can consider the following thought 
experiment. Suppose there is a very dilute gas of photons at the same 
temperature as the electrons. Then the number density n (E) of photons 
is given by the Boltzmann limit of the Planck distribution: 


n(E) dE x E? (ôE — 1)" dE œ E? exp (—E/kpT) dE. (2.72) 


In this case, since the temperatures are the same, we expect the net energy 
transfer between the electrons and photons to vanish. (You need to use the 
Wien limit rather than the exact Planck spectrum to ensure that photons 
behave as particles.) That is, we demand 


0= | dE n(E)AE (2.73) 
| 


where n(E) x E*exp(—E/kgT). Substituting for AE and n(£) from 
Eq. (2.71) and Eq. (2.72), one can easily show that (4kgT + cskpT) = 0 


or c5 = —4. Hence we get the final result 
AE (E~—4kpgT) 
ae oe 2) 


One can say that, in a typical collision between an electron and photon, 
the electron energy changes by (E?/mc?) and photon energy changes by 
(4kpT/mc*)E. (To order v/c, the probability of photon getting redshifted 
or blueshifted is equal and AE/E = v/c where v is the velocity of the 
electron. On the average, this gives zero energy transfer to O(u/c). But to 
order O(v?/c) the probability for blueshift is larger than the probability 
for redshift which is why you get AE/E ~ v?/c?.) 

Repeated scattering of photons by hot electrons can lead to a process 
called Comptonisation, which is of importance in some astrophysical sys- 
tems. The mean-free-path of the photon due to Thomson scattering is 
Ay = (neor). If the size of the region l is such that (I/A,) > 1 
then the photon will undergo several collisions in this region; but if 
(l/Ay) S 1 then there will be few collisions. In terms of the optical depth 
Te = (l/r) = (neorl), the Te >> 1 limit implies strong scattering. If 
Te © 1, then the photon goes through N,(> 1) collisions in traveling a 
distance l. From standard random walk arguments, we have N? / Pe ~l 
so that Ns = (l/A,)? = 72. On the other hand, if Te <1, then Ns ~ Te; 
therefore we can estimate the number of scatterings as Ns ~ max(Te, T2). 
The average fractional change in the photon energy, per collision, is given 
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by the second term in Eq. (2.74), viz., 4(kpT./mec’). Hence the condition 
for significant change of energy is 


Ns (=) =4 (=) max (Te, T2) ~1. (2.75) 


Mec? Mec? 








It is conventional to define a parameter y, (called Compton-y parameter) 
by 





y= (23) max (Te, 72) A (2.76) 
Then we can write the condition for significant scattering as y œ 1/4. 

Inverse Compton scattering leads to an interesting physical phenomena 
called Sunyaev-Zeldovich effect which is proving to be of great importance 
in the study of clusters of galaxies. The standard model of the universe, 
which we will study in detail in chapter 6, shows that we are permeated 
by a radiation background (CMBR) with a temperature of about 2.73 K. 
The large clusters of galaxies which contain hot electron gas, at temper- 
ature Te, are embedded in this background radiation thereby leading to 
inverse Compton scattering of the microwave photons by the hot electrons. 
Each scattering will boost the photon energy by the fractional amount 
(kpTe/m-c*) without, of course, creating new photons. Thus the photons 
in the low energy tail of the CMBR blackbody spectrum are moved to 
high energies thereby leading to a spectral distortion in the direction of 
the cluster, with less photons at low frequencies and more photons at high 
frequencies (compared to the Planck spectrum). At long wavelengths this 
will essentially appear as a uniform drop in the temperature given by 


AT y ft, 
P.S eC? 





neor dl (2.77) 


where the integration is along the path of the photon through the hot 
gas. The factor 2 comes from a more detailed analysis but the rest of the 
factors are easy to understand as due to inverse Compton scattering. This 
effect has been measured for several large clusters and acts as a powerful 
diagnostic tool. 

As a simple illustration, consider a cluster containing an intra cluster 
plasma which is distributed with a radial dependence given by 


Ne 


(1+ r?/r2) ae) 


n(r) = 
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where re is a constant. The optical depth through the cluster which appears 
in Eq. (2.77) becomes 


"e Ea = -3 (Ne Te 
To = OT D n(r)dr = 6.4 x 10 (=) (5) ; (2.79) 








We can obtain some additional information about ne and Tre from the x- 
ray emission of the plasma. Using the bremsstrahlung emissivity eg ~ 
1.4 x 10-?7T1/2n? erg s~! cm™? (see Eq. (2.43)) we find that the total 
x-ray luminosity is given by 


Te 
Ly = an f eeg rĉdr 
0 


3 1/2 
= 41 p—1 Nc 2 Te kBTe —1 


where f is a “filling factor” which is a ratio of the x-ray emitting volume 
to the core volume of the cluster. Observations suggest that, for a typical 
cluster, Lx = 1044h~? erg s7! re œ 200h7! kpc and T, œ% 4 keV where 
h is the dimensionless Hubble expansion rate of the universe. Using these 
in Eq. (2.80) we can estimate n, = 5 x 1073h1/2 f-1/2 cm™3. Substituting 
this result in Eq. (2.79) we get To = 6.4 x 1073(hf)~1/?. Equation (2.77) 
now gives 











AT _ 2 kpTe 
T 2 


= —1.0 x 1074( fht). (2.81) 
mc? 


Though small, this is observable by current technology. Thus, the mea- 
surement of (AT/T) and modeling of f will allow an estimate of h which 
is extremely important in cosmology. 


2.5 Radiative processes in quantum theory 


We shall now take up the study of radiative processes in quantum theory 
which uses a somewhat different language. To work out the details of 
quantum radiative processes, we need to determine the relevant energy 
levels (which, in turn, will fix the frequency of photons which are emitted, 
through AE = fw) as well as the transition rate Q between the two levels. 
Let us begin with a determination of the energy levels of simple atomic and 
molecular systems. . 
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2.5.1 Energy levels 


The Hamiltonian describing a nonrelativistic electron, moving in the 
Coulomb field of a nucleus of charge Zq is given by Ho = (p?/2me) — 
(Zq?/r). If this electron is described a wave function (x, L) where L 
denotes the characteristic scale over which wy varies significantly, then the 
expectation value for the energy of the electron in this state is E(L) = 
(Y| Holp) ~ (A? /2m_L7) —(Zq?/L). The first term arises from the fact that 
(plp?) = —h2(|V?\w) ~ (h?/L?), which is equivalent to the uncertainty 
principle stated in the form p = A/L. This expression for E(L) reaches a 








minimum value when L is varied at Lmin = (@0/Z) with Emin = —Z7€a 
where 

h? x = mq 1 
a = a = = z 0.52 x 1078 cm; éa = T = zo mec” = 13.6 eV. 


(2.82) 
Here we have defined Àe = (h/me-c) and a = (q*/hc). The quantities ao 
and €a correspond to the size and ground state energy of a hydrogen atom 
with Z = 1. (These results are exact even though our analysis based on 
p œ~ h/L is approximate.) The wavelength À corresponding to the energy 
€a is À = (he/eg) = 2a7? e ~ 10° A and lies in the UV band. The fine 
structure constant a ~% 7.3 x 107? plays an important role in the structure 
of matter and arises as the ratio between several interesting variables: 


gp a AUB ast os Ke (2.83) 


where v is the speed of an electron in the atom, ug = (qħ/2mec) is the 
Bohr magneton representing the magnetic moment of the electron and ro = 
(q7/mec?) is the classical electron radius. The actual energy levels of the 
hydrogen (like) atom are En = —e,/n? with n = 1,2... 

The transitions between these energy levels in an atom will lead to pho- 
ton energies upwards of few electron volts and the corresponding wavelength 
will be in the optical and UV bands in most of the cases. Note that, while 
the characteristic size of the atom is ao, the typical wavelength of radiation 
emitted from the atom is à = (2/a)ao which is about 300 times longer, 
due to the smallness of a. The fact that the ground state energy scales as 
Emin x Z? is also of importance while considering atoms with large atomic 
numbers (See Ex.2.15). 

The actual Hamiltonian for an electron in the hydrogen atom is a bit 
more complicated than what we have used above and a more precise form 
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leads to some interesting effects. To begin with, the correct expression for 
kinetic energy is not (p?/2m,) but E = (p?c? + m2c*)!/2 — mec?. The 
expression for E(L) which we need to minimize is then 


~ [Re 24 ue 2 29 
E(L) = EE + Mec | —MeC — ae (2.84) 
The minimum value now occurs at 
h 1 1/2 
Lmin = — |- — 1 : 2.85 
Mel Er | ( ) 


As long as Za < 1, this will reproduce the previous results.? To see the 
effect of relativistic correction we can expand the Hamiltonian in Eq. (2.84) 
in a Taylor series in (p/mec). The lowest order correction will be 


4 2 2 2 72 
E~ E gt. ( p ) N (=) l x a Ea ~ 1073 eV. (2.86) 
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The zeroth order term Ho is of the order of (q?/ao) = 10 eV corresponding 
to À œ 1200 A, while this correction gives 10~* eV with the corresponding 
photon wavelength of A ~ 1 mm. (It is worth remembering that 1 eV 
corresponds to a photon wavelength of 12400 A.) 

There is another effect which leads to a correction of the same order 
of magnitude œ? Eo. This arises from a term in the Hamiltonian, Hsp-or, 
due to the coupling between the spin magnetic moment of the electron 
He ~ (qħ/2mec) and the magnetic field B & (v/c)E & (v/c)(Zq/r?) in 
the instantaneous rest frame of the electron, obtained by transforming the 
Coulomb field. This should have the magnitude 

z Z@ hu Zg 


Bpen Beana mza U s) (2.87) 





where l and s are the orbital (mevr) and spin (fi/2) angular momentum of 
the electron. (The actual result is half of this value with the extra factor 
arising due to an interesting phenomenon called Thomas precession which 


2 As an aside, let us note two curious aspects of Eq. (2.85). First, you see that Lmin 
will become imaginary for Z > a~} % 137. This is a real relativistic instability which 
arises in heavy ion collisions. Second, if we did the same analysis for a gravitating system 
made of N nucleons, then Za = Zq?/hc will be replaced by (GMmp/hic) = (Gm3/fic)N. 
Clearly we expect some instability when N ~ (ħc/Gm2). We will see this effect in chapter 
5 in the study of stellar remnants, though a precise analysis leads to the instability when 
Nw (Re/Gm2)3/2. 
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we will not discuss). This correction, with | ~ s œ A, is 


242 2 2 
ae ( h ) ZA Ea (2.88) 
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which is of the same order as the correction found in Eq. (2.86). These two 
corrections in Eq. (2.86) and Eq. (2.88) together are called fine structure 
corrections (since, historically, this is related to the fine structure observed 
in the spectral lines). 

The angular momentum conservation requires that a fine structure tran- 
sition will require Al = 1. In the case of hydrogen, one rarely finds an 
electron in the n = 2,1 = 1 state allowing a fine structure transition to 
the ground state. But the situation is different for heavier elements like, 
for example, singly ionized carbon C+. Since C+ has 5 electrons, the fifth 
one will end up at n = 2,l = 1 state; further, the fine structure energy 
difference for carbon is about 1072 eV because the expression in Eq. (2.86) 
scales as Z4. This corresponds to a temperature of about 50 K which exists 
in diffuse molecular clouds in the interstellar medium. Collisions between 
atoms can excite transitions between these levels. De-excitation will then 
occur through emission of radiation which escapes from the cloud. The 
net effect is the loss of kinetic energy of the atoms. Hence, fine structure 
transitions of heavier elements like carbon can be a main source of cooling 
in the interstellar medium. 

Finally, one also needs to take into account the direct interaction be- 
tween the magnetic moments of the electron and proton. This is represented 
by an extra term in the Hamiltonian 

ti 3r- i 
Asp—sp =. Be d E E n | 





(2.89) 


which gives the coupling between the magnetic moments of the nucleus and 
the electron. These magnetic moments are given by 
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where upg is the Bohr magneton, uy is the corresponding quantity for 
the proton and (s,S) are the spin vectors of electron and proton. This 
correction is of the order of 
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(corresponding to À ~ 10? cm) which is smaller than the fine structure cor- 
rection discussed above and is called hyperfine correction. A more precise 
calculation gives the wavelength of radiation emitted in the hyperfine tran- 
sition of hydrogen to be about 21 cm. This radiation, which is in the radio 
band, is used extensively in astronomy as a diagnostic of atomic (neutral) 
hydrogen. 

As a further example, consider the corresponding hyperfine transition 
in ŝHet. Since the frequency scales as Hnuc1/r?, we need to estimate how 
Lnuct and r vary when we go from hydrogen to singly ionized helium. Since 
magnetic moment of the proton has g/mp, we can take unuci « Z/A; further 
r scales as 1/Z. Hence we expect the hyperfine energy to vary as Z4/A. 
The wavelength of transition in helium will therefore be 21 cm x(A/Z*) = 
(21 x 3)/2* = 3.9 cm which is pretty close to the observed value of 3.46 cm. 

Let us next consider the radiation arising from the molecular energy 
levels. The simplest molecular structure consists of two atoms bound to 
each other in the form of a diatomic molecule. The effective potential 
energy of interaction U(r) between the atoms in such a molecule arises from 
a residual electrostatic coupling between the electrons and the nuclei. The 
resulting potential energy has a minimum at a separation r ~ ag (which is 
typically of the order of the size of the atom) and the depth of the potential 
well at the minimum is comparable to the electronic energy level, €a, of the 
atom. The atoms of such a molecule can vibrate at some characteristic 
frequency wy, about the mean position along the line connecting them. 
Treated as a harmonic oscillator, this will lead to the vibrational energy 
levels separated by Eyin © Awyip. If the displacement x of the oscillator 
is x ~ ao from the minimum, we expect the vibrational energy Eyip to be 
about €a. Writing €a ~ (1/2)uw?,,,a3 S (A? /meaĝ), where p is the reduced 
mass of the two atoms making the molecule, we can determine wyi,. We 
get 


Me 
u 





h2 
Eyib = ħwyib © x% 


1/2 
x% aœ0. 2. 
um) ia ) €a ~ 0.25 eV (2.92) 


if 4 ~ mp. This is the characteristic energy of photons emitted in vibra- 
tional transitions. 

Such a diatomic molecule can also rotate about an axis perpendicular 
to the line joining them. If the rotational angular momentum is J, this will 
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contribute an energy of about 


J? h? - 
Exot © (4) x (5) N (=) €a = 10 ™°ea ~ 107-2 eV (2.93) 


2 
Hag paĝ 


if J ~ hand u ~ mp. It follows from Eq. (2.92) and Eq. (2.93) that 
Exot : Eyib : Eo © (me/p) : (me/p)/? : 1 and Eyib © VeaEzor. Since 
(me/) ~ 1073, the wavelength of radiation from vibrational transitions 
are about 40 times larger than those of electronic transitions; similarly, the 
rotational transitions lead to radiation with wavelengths about 1000 times 
larger than electronic transitions. 

Taking 4 © mp, of course, is only true if you are thinking of Hz molecule, 
which unfortunately has no permanent dipole moment because of its sym- 
metry. Considering the fact that, after hydrogen and helium, some of the 
most abundant elements in the universe are carbon, oxygen, silicon ... ete., 
one might try CO molecule as a good choice. For CO, u © 30m, which gives 
the wavelength of transition to be Aco œ% 2.6 mm which is in the infrared 
band. 

The energy level structure of common atoms and molecules is of consid- 
erable importance for life on Earth in which the atmosphere shields most 
of the photons from reaching the ground. To begin with, very high energy 
photons (€ > 2mec? ~ 1 MeV) lose their energy due to production of ete7 
pairs when they pass through matter. (You can easily show that this pro- 
cess cannot take place in the vacuum since you cannot conserve both energy 
and momentum for the process 27 — ete~ in the vacuum; however, in the 
presence of a nucleus to absorb the momentum, this process can convert 
radiation into electron-positron pairs.) For photons in the energy range 50 
keV S e S 1 MeV, the most dominant energy loss process is Compton scat- 
tering with the electron. Since the atomic binding energies are usually small 
compared to this range of energies, the photon essentially sees the electron 
as “free” particle. Below about 50 keV, photo ionisation starts dominating 
over Compton scattering as the energy loss mechanism. This continues till 
the energy drops to about 3 eV. It turns out, miraculously enough, that 
there are no viable transitions in matter in the energy range of about 1.3 
eV X e <3 eV. This band of photons, therefore, can propagate through 
matter in the atmosphere fairly freely and constitutes the “visible” band 
of photons. Below 1.3 eV down to about 1074 eV, some amount of absorp- 
tion takes place because of molecular energy levels. Further down, we get 
another transparent window between 1074 eV and 1078 eV (corresponding 
to the wavelength range of 1 cm to 100 m; the radio band). Below 1078 
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eV, radiation cannot propagate through the ionospheric plasma (you will 
see why in chapter 3, Sec.3.4.2) thereby closing the window permanently. 
Thus ground based astronomy is mostly confined to optical and radio bands 
essentially because of the nature of atomic and molecular energy levels. 


2.5.2 Transition rates and cross sections 


The energy levels of atomic systems determine the frequency of radiation. 
But the intensity of radiation, related to the number of photons emitted 
per second, is determined by the transition rate Q (see Eq. (2.9)), which 
we shall now estimate for simple cases. 

The basic rate, governed by Q, can be estimated using the correspon- 
dence with classical theory. If the rate of transition between two energy 
levels is Q and the energy of the photon emitted during the transition is 
hw, then the rate of energy emission is Qħw. Classically, the same sys- 
tem will emit energy at the rate (d?w*/c*) where d is the electric dipole 
moment of the atom. (See Eq. (2.8)). This assumes that the radiation 
is predominantly due to direct coupling between the electric field and the 
dipole moment of the atom; if not, you have to use the relevant moment — 
like the electric quadrapole moment or magnetic dipole moment, instead of 
d — in this equation. Writing Qhw ~ (d?w‘/c?), we find that 


wd? aANT œ (mec? Gg 








where we have used d = gag œ (qA-/a) and hw ~ (1/2)a? mec?. The rate 
Q can also be expressed in different forms as 


2 2 2 
q 4 3 q 2 270 
zx > sw = w’ = w*— 2.95 
Q hc 2 Mec? c ( ) 





where ro is the classical electron radius. A more precise quantum mechan- 
ical calculation corrects this by contributing an extra numerical factor f, 
called oscillator strength which is usually of order unity for strong lines. 
The transition rate for other processes, like e.g. the 21 cm radiation, 
can be estimated in the same manner using Eq. (2.8). In the relation 
Qħwz =~ (w4,d?/c3), we now have to use the magnetic dipole moment of 
electron d ~ (qħ/mec). This gives Q ~ a(X-/c)*w3,. Estimating hwo: from 
the expression (2.91) derived above for the hyperfine structure energy level, 
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this transition rate can be evaluated to be 


Tow hw 
Qorem ~ ( 2 2) ( z) w ~ 107}5 s71, (2.96) 


c Mel 








The ratio between this rate and the transition rate between the primary 
energy levels of hydrogen atom, computed earlier in Eq. (2.94), is 


cm 3 Xe ? 
Qim (=) (=) x 107%, (2.97) 


Q w ao 


Clearly, hyperfine transitions are very slow processes. It would not have 
been useful astrophysically but for the fact that there are so many neutral 
hydrogen atoms out there. 

Let us verify these ideas with two more examples. First, consider the 
corresponding hyperfine transition in “Het for which we estimated the 
wavelength of radiation to be 3.9 cm earlier (see discussion after Eq. (2.91)). 
Since Q varies as w œ A~%, we will expect the rate of hyperfine transition 
in ?He* to be (21/3.9)? times larger than the Qo; cm. This gives about 
6 x 10-!8 s71. This is off from the true value only by a factor three. As 
a second example, consider the molecular transitions in CO which is given 
by an expression similar to Eq. (2.94) with d representing the permanent 
dipole moment of the molecule and w = 2mc/Aco with Aco ~% 2.6 mm as 
estimated earlier (see discussion after Eq. (2.93)). It turns out that the 
dipole moment of CO is 0.1 debye where 1 debye = 10718 in the c.g.s units 
for measuring dipole moment. This is a convenient unit because qao = 2.5 
debye. (The lower dipole moment of CO is a peculiarity arising from the 
strong double bond connecting C and O — most other molecules like H20, 
SiO, Cs ..... all have dipole moments which are of the order of 1 debye 
which is what one would have naively expected.) Using these values we get 
a transition rate of Qeo = 8.7 x 1078 per second which is pretty close to 
the observed value of 6 x 1078 per second. 

One can also estimate the width of the spectral lines from the decay 
rate. Since every excited state has a probability to decay spontaneously to 
the ground state with a decay rate Q, the lifetime At of the excited state 
is about 


2\ 71 —2 8 
At~ Qo) x (£) (>) wo) (=) ~ 1079 s (2.98) 


where the numerical estimate is for the optical radiation. (It is correspond- 
ingly larger, and is about At = Q3] em = 10" s for the 21 cm radiation.) 
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From the uncertainty principle between the energy and time, it follows that 
the energy level of the excited state will be uncertain by an amount AE 
of the order of (h/At) ~ AQ. In the absence of such an uncertainty, the 
transition between the two energy levels could result in an infinitely sharp 
spectral lines at a specific frequency w. The width of the excited states 
leads to a corresponding width to the spectral line called the natural width 
Aw where (Aw/w) = (Q/w) % 1078 for the main hydrogen lines. In terms 
of wavelength, the natural width AA = 27c(Aw/w?) is of the order of the 
classical electron radius: Aà % 2rro providing another interpretation for 
ro. 

The same rate Q also governs the absorption of radiation at the tran- 
sition frequency by the system. To take into account the finite line width, 
it is convenient to introduce a (frequency dependent) bound-bound cross 
section gpp (w) for the absorption of radiation by the system. The function 
Obp(w) is expected to be sharply peaked at w = wo with a width of Aw. 
If the phase space density of photons is dN = n{d°ad3p/(27h)3], then the 
number density of photons per unit volume participating in the absorption 
is N = nd?p/(2rh)> = (nw%/mc?)(Aw/27) if we assume that the photons 
in the band Aw around wọ are absorbed. The corresponding flux is Nc and 
the rate of absorption is 


Aw 


2 
I RUO 8 ne 
N Tbe = Te Obb on (2.99) 
which we know should be equal to 
DPEN EO. 
Qn = w2 ( ; ) n (2.100) 


giving opp(Aw/27) = mroc. Since we expect opp to be a sharply peaked 
function at wo with a width Aw, this result can be stated more formally as, 


Mel To 








This — in turn — suggests expressing the cross section as 
2 
mq 
= — 2.102 
Orb (w) ma” ( ) 


where ĝu is called the line profile function; the integral of ¢,, over (dw/27) 
is unity to satisfy Eq. (2.101). When the line widths are ignored, ¢,, is 
proportional to the Dirac delta function; when the finite width of the energy 
levels is taken into account ¢, will become a function which is peaked 
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at wo with a narrow width Aw, such that, at the transition frequency, 
(wo) S (Aw)—!(27). The absorption cross section at the center of the line 
w = wo is given by o œ 2r?(rc/ Aw) ~ (A2/2) where Ao is the wavelength of 
the photon. A more rigorous quantum mechanical theory of transition rates 
is needed to determine the explicit form for #(w) and overall multiplicative 
numerical factor f called the oscillator strength. It can be shown that the 
functional form of ¢(w) is a Lorentzian so that the final cross section in 
Eq. (2.102) is usually written in the form: 

mg? m(q?/me)l f 


Tbb = TE fho z ( 


The line width described above is usually called intrinsic or natural line 
width. (The Lorentzian profile can be most easily understood by treating 
the dynamics of the electron in an atom as that of a damped harmonic 
oscillator with a natural frequency wo and damping constant T; such a 
classical model captures the essence of quantum mechanical calculations 
with surprising accuracy.) 


2.5.3 Ionisation and recombination 


So far we discussed the transitions between bound states with discrete en- 
ergy levels. Another important class of radiative process involve transitions 
between a bound state of an atom and the continuum. We shall now discuss 
this process, and the related opacities. 

To study these processes which essentially corresponds to a photon ion- 
izing an atom, we will begin by relating the photo-ionisation rate to the rate 
for the time reversed process, which is recombination rate in which an ion 
and electron gets bound together, releasing the excess energy as radiation. 
The recombination rate per unit volume of a plasma will be proportional 
to (a) number density of electrons, ne, (b) number density of ions, n;, (c) 
the relative velocity of encounter, v and, finally, (d) the cross section for 
the process, which will be about 7l?, where | gives the effective range of 
interaction between electron and proton. There are two possible choices 
for |. From Coulomb interaction, we get a characteristic length scale for 
charged particles approaching each other with relative velocity v, which is 
about lı S (2Zq? / Mev); this is the scale at which electrostatic interaction 
energy becomes comparable to the kinetic energy. On the other hand, an 
electron of speed v has a de Broglie wavelength of l2 S (h/mev) and will 
make its presence felt within a region of this size. Clearly should choose | 
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to be the larger of lı or l2 depending on the context. 

Let us first consider the case with | = l2, which corresponds to (u/c) Z 
(q?/hc). Since each recombination releases an energy of about (1/2)mev? 
we can estimate the rate of energy release per unit volume to be: 


dE Mev? h V? ; 
rec e , pl/2 o „21/2 
(Fe) x ( 5 ) (neni) (=) vx neni T" x pT (2.104) 


where we have assumed that mev? ~ kgT,ne x p,ni x p. In equilib- 
rium, the photo-ionisation rate (which removes energy from the radiation 
field) should match the recombination rate. The amount of energy removed 
by photo-ionisation is proportional to dEion X NatomdEragd where ø is the 
photo-ionisation cross section which needs to be determined. Equating 
dEjon to dErec and using Erag «x T4, we get 








Natom o T4 « nen: T!/2 (2.105) 


so that o « pT~°5. Introducing the bound-free opacity Kpt by the definition 
Kof = (Natoma /p) and taking ne x p, ni x p we find that 


kat x pT 35 (2.106) 


which scales just like the free-free opacity in Eq. (2.48). 

In a radiation bath with temperature T, the typical energy of photons is 
hv ~ kpT. The result Eq. (2.106) suggests that, in a more general context, 
the frequency dependence of this cross section will be of the form o(v) = 
ove(v/vr)~* for v > vyr (and zero otherwise) with s ~ 3 and v7 = (€a/h) is 
the frequency corresponding to the ionisation energy €a of the atom. The 
prefactor opf can be estimated using the fact that any absorption cross 
section o(v) satisfies the constraint Eq. (2.101): 


. o(v) dv = nrocf = ua f (2.107) 
i ( 


Mec 





where the term within the bracket comes from classical theory and the 
oscillator strength f needs to be supplied by the full quantum theory. Using 
this, we find: 


Obf = m(s = 1)f roàr = 2r froàr (2.108) 


with Az = (c/vr),s ~ 3. Therefore, the photo-ionisation cross section is 
essentially the product of the classical electron radius and the wavelength 
at ionisation threshold. Using A; = 912 A, we get opf = 10717 cm? for 
hydrogen. (Incidentally, this shows that a column density N, of hydrogen 
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which leads to unit optical depth, Neopr = 1, is given by Ne ~ opp œ 10!7 
cm~?, This is useful in studying quasar absorption systems; see Chapter 
9.) For heavier elements o will scale as 


o(v) = Ove (27 « fro (<) (#)° x (5) x Zv’ (2.109) 


if we take s = 3. 

Given the form of øf, we can determine the cross section for recom- 
bination, Grec. Recombination can occur with an electron having any mo- 
mentum while the ionisation can be caused by photon of any energy. Equi- 
librium therefore requires 4c,;¢d°p. = 80p¢d°p where 4 = 2 x 2 is due to 
electron and proton spins, 8 = 2 x 2 x 2 is due to electron, proton and 
photon spins and pe, py refer to the momenta of the electron and photon. 
If the momentum of the photon arises from that of the electron in the re- 
combination, we have dpe ~ dpy. This gives Grec = 2op¢(py/Pe)?; if the 
plasma is at some temperature T, then 


2(ħw)? Ea Ea -29 [ kBT si 2 
rece = 57 Na = 2| — —— ~ 10 — x 
j Elment ™ (45 (Ż Tee 10 eV a 
(2.110) 


The rate of recombination per unit volume per second is n?c;ecv = n?aR 
where ap (called recombination coefficient) is given by 


OR = Orec¥ © 2 x 1074 (kgT/10 eV)? cm? s7}, (2.111) 


This result is extensively used in the study of astrophysical plasmas. 

In the analysis so far we have assumed that the recombination proceeds 
directly to the ground state. Such a process, however, will release a photon 
with energy hw > €a which can immediately ionize another atom. The net 
recombination usually proceeds through electron and proton combining to 
form an excited state of the atom which decays to ground state later on. 
All the photons emitted in the process will then have hw < €a and cannot 
ionize other neutral atoms in the ground state. In this case, the speeds are 
usually low (v/c < q?/hc) and we can write a relation similar to Eq. (2.104) 
but with | = lı = (2Zq?/m,v?). Then the energy loss due to recombination 
is: 





dE vec 1 2 Zg 5 -1 2-1/2 
= D) (1 i T i 
(T) x (gme ) en) (ea) VX NNU XN 
(2.112) 
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This can be a source of cooling for plasma at T > 104 K, in addition to 
the bremsstrahlung cooling (dEbre/dVdt) « n?T1/? which was obtained 
earlier in Eq. (2.43). Since both processes scale as n? the total rate of 
cooling for a plasma can be expressed in the form (dE/dVdt) = n?A(T) 
with A(T) = aT!/? + bT-1/2 where a, b are numerical constants; the first 
term due to Bremsstrahlung dominates for T > 10° K. The function A(T) is 
called the cooling curve of the plasma and it incorporates the effects of both 
line cooling as well as continuum radiation in the form of Bremsstrahlung. 
Incidentally, at very low temperatures, neither process is effective and the 
gas simply cannot cool since collisions cannot excite the atomic energy 
levels. To capture the behaviour at low temperatures, one needs to take 
into account the fact that the gas will be mostly neutral. A reasonable 
fitting formula for the hydrogen plasma, taking all these effects into account 
is given by 


10-24 


AD) = [1+ ar exp(€o/kpT)|? 


[2175 164 9.447 "| (2.113) 


where €a = 13.6 eV, a = 2.8 x 10° and all quantities are in cgs units with 
Ts = (T/10° K). 

As a specific example involving photo-ionisation and recombination, let 
us apply these ideas in the case of a gas of hydrogen atoms, ionized by a 
flux of radiation F from some source (like the HII regions around stars). 
Let NH, Ni, Nne be the number densities of hydrogen atoms, protons and 
electrons respectively. The steady state will be reached when the rate 
of ionisation, o»fny F, is balanced by the rate of recombination, nenjar. 
Writing neniarR ~ opfnyF where F is the flux of ionizing photons with 
frequency higher than the ionisation threshold (with v > vr) and taking 
Ne = Nj = Tno, NH = (1 — z)no, we get: 


r?  fonec F 4 T \ eR 
a = T) = (=) (=) ~5x10 (ak z) (=) è (2.114) 


This relation determines the ionisation fraction x in many astrophysical 
contexts. If a source of photons emitting Ny photons per second (with 
v > vr) ionizes a region of volume V around it, then the same argument 
gives n2arRV ~ Ny. Taking ne = zno œ No, we can determine the volume 
of the region which is ionized to be V = (N,/apn}). 

To get the feel for the numbers involved, consider the region around a 
hot star with luminosity L = 3.5 x 1036 erg s71 and a surface temperature 
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T, ~ 3x 10* K. The number of ionizing photons, Ny, (with v > vz) emitted 
by such a star can be estimated by integrating the Planck spectrum, from 
v = vz to vy = oo, and will be about 3 x 104% s~!. Using the result 
V = (N,/an) obtained above, with no ~ 10 cm~3, we find that matter 
will be fully ionized for a region of radius R = (3V/47)'/3 ~ 10 pe. At a 
distance of 5 pc from the star Eq. (2.114) gives (1 — x) ~ 107° indicating 
nearly total ionisation. The radiation from such HII regions is one of the 
probes of the interstellar medium. 

Finally, we mention another important complication in the case of 
bound-free transitions of hydrogen atom which has astrophysical signif- 
icance. It is possible for the neutral hydrogen atom to capture an- 
other electron and become a negative hydrogen ion. This HT ion isa 
a bound state made of two electrons and a proton, with a binding en- 
ergy of about 0.75 eV. (The presence of an extra electron polarizes the 
hydrogen atom leading to a residual electric field in the vicinity which, 
in turn, binds the electron.) The bound-free absorption from this state 
gives rise to continuous opacity at wavelengths shorter than 16502 A corre- 
sponding to the energy 0.75 eV. For conditions prevalent in sun-like stars, 
the opacity due to hydrogen ion can be approximated by the expression 
kg- œ% 2.5 x 10791(Z/0.02) p!/?T® cm? g`! where Z is the fractional 
abundance of elements heavier than H (which usually supplies the extra 
electron) and all the quantities are in c.g.s. units. You will see in Chapter 
4 that this can be a very significant source of opacity in stars like the Sun. 


2.5.4 Spectral line profiles 


We saw earlier that the spectral lines have a natural line width which is 
described by a Lorentzian line profile [see Eq. (2.102)]. In astrophysical 
contexts, there are two other processes which contribute to the line widths. 
The first is due to the collisions between the atoms while the second is a 
Doppler shift arising from the random motion of atoms in thermal equilib- 
rium. 

The collisions will change the phase of the electric field of the radi- 
ation in a random manner, so that the electric field will now become 
E(t) = Eoeot-Ft/2et¥ where w(t) is a random function of time such 
that it changes completely randomly at each collision and remains the 
same between collisions. The net effect of averaging e*” over such colli- 
sions is to introduce a term exp(—v-t) into the decay rate of excited atoms 
where ve is the collision rate (see Ex. 2.16). Therefore collisions can be 
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accounted for by the same line profile as in Eq.(2.103) with the replacement 
1/2 = T/2 + ve = T" /2. 

The thermal broadening of the lines, due to the Doppler effect, leads 
to a different line profile: In thermal equilibrium, the probability distri- 
bution for the line-of-sight velocity (say, vs) is a Gaussian proportional to 
[exp — (v2/vp)] with vp = (2kgT/m) where m is the mass of the atom 
emitting the radiation. When an atom moving with velocity vz emits light, 
there will be a Doppler shift (Aw/w) ~ (vz/c). Hence the line profile will 


be 
Ilw) CRONAN (2.115) 
w) x exp a 7 : i 
The fractional line width (Aw/w) is (vr/c). Since vr ~& 


0.1km s7! (T/1 K)\/? for hydrogen and (Aw/w) ~ 1077 for the natural 
line width, it is clear that Doppler width will be larger than the natu- 
ral line width for vr > 1077c œ 0.003kms~*. So one might think that 
this will completely dominate the natural and collisonal line widths. How- 
ever, the Gaussian width due to Doppler effect falls much more rapidly 
than the Lorentzian line profile. As you can easily verify, a Lorentzian 
A?(z? + A?)—1 will dominate over a Gaussian exp(—x?/20?) for (x/o) > 5 
even if A ~ 1073s. Hence the natural width will dominate over the Doppler 
width away from the center of the line. 

When all the three sources of line broadening are present, the net line 
profile can be determined by taking the Lorentzian profile due to natural 
and collisional broadening and convolving it with the probability distribu- 
tion for the velocities. Since the probability for the velocity to be vz is given 
by a Maxwellian distribution at some temperature T, the net line profile 
can be easily shown to be 





Pror(w) = ain | aoa (2.116) 
where 
Be ay) Eo ee) Pee) 
(2.117) 


The function in the right hand side of Eq. (2.116) is called the Voigt function 
which is Gaussian near the origin and Lorentzian away from the origin. A 
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good approximation to the Voigt profile with correct asymptotic limits is 


$= (5) ier i | ; (2.118) 


Awp TU? 





This profile is used extensively in the study of spectral lines from astro- 
physical systems. 


Exercises 


Exercise 2.1 Numerical estimates for light: 


Estimate (i) the total intensity J.dv (in Watts cm~’), (ii) amplitude of electric 
field (in Volts cm~') and (iii) number density of photons in logarithmic band of 
frequency, ny (dv/v), for: (a) A mercury lamp, (b) Laser. 


Exercise 2.2 Aspects of intensity: 


A spherical source of radiation (with radius R) has a uniform intensity IL. (a) 
Show that the total flux of radiation from the source at a distance r from the 
center of the source will be 


R\?2 
S, = J I, cos @dQ = rly (2) . (2.119) 
R rT 


(b) The flux radiated from the surface (at r = R) is mI, while the total solid 
angle is 4m. Where did the factor 4 vanish? 


Exercise 2.3 How dark is it? 


Compare (i) the energy incident on your retina per second when your eye is closed 
due to photons inside your eye (by treating the cavity of the eye as a black body 
at body temperature) with (ii) the energy flux of photons in your eye when you 
are viewing a 100 W bulb at a distance of 1 m. Explain the result. (Hint: You 
will find that (i) is larger than (ii) by a factor of about 6500. The reason it is 
dark when you close your eye has to do with the frequency band in which the two 
radiations peak and the sensitivity of the eye rather than to the incident energy 
densities. If you compute the energy density between (400 ~ 800) nm in the two 
cases, you will find that the light bulb dominates.] 


Exercise 2.4 Power in/of radio astronomy: 


(a) Consider a radio telescope of collecting area A (about 7150” m? for Arecibo) 
that is used to observe Jupiter (brightness temperature T = 122 K) at the fre- 
quency v = 30 GHz. Compute the total power in Watts received from Jupiter 
which has a radius ry œ 7.1 x 10* km and is at a distance of d œ 4.2 AU. [Hint: 
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Jupiter subtends a solid angle 2 œ (rz/d)?. In the Rayleigh-Jeans approxima- 
tion, the specific intensity is B, = 2kgT/X? Jy ster~'; the observed flux density 
is BLQ and the received flux is vB, 9. Hence the total power received is vB, QA.] 

(b) Why should stimulated emission be important in the radio but not in 
ultra violet? 


Exercise 2.5 By Jove, a planet: 


Jupiter can be approximated as a blackbody with a temperature of 122 K. If it 
has a radius of 7.1 x 10* km, calculate the bolometric luminosity of Jupiter. How 
does it compare with that of Sun? Suppose you have a satellite based telescope, 
operating at the wavelength at which Jupiter’s emission is peaked, and is sensitive 
up to apparent magnitude of 30 (this is typically the sensitivity Hubble Space 
Telescope has but, of course, Hubble is sensitive at visible band). Will a planet 
like Jupiter be bright enough for detection at a distance of 10 pc? If the angular 
resolution of the telescope is 0.1 arcsec, what is the maximum distance from which 
it can resolve Jupiter and Sun (they are separated by 5.2 AU)? 


Exercise 2.6 Doppler effect on temperature: 


An observer is moving with speed v through a region containing isotropic black- 
body radiation of temperature T. Show that the intensity of radiation measured 
by this observer, in a direction making angle 0 with respect to the direction of 
motion, will be Planckian in form but with a Doppler shifted temperature 


T' =T (GES : (2.120) 


1 — (v/c) cos 0 


(Incidentally, since T” depends on 9, this result shows that there is no such thing as 
‘Lorentz transformation law for temperature’.) (a) Expand this result in a Taylor 
series in v/c and show that the lowest order term is a dipole. The motion of our 
local group through the Cosmic Microwave Background Radiation (CMBR) will 
lead to such a dipolar anisotropy of the CMBR temperature. Assuming that this 
anisotropy has an amplitude of about (AT/T) = 107°, estimate the speed with 
which the local group is moving through CMBR. (b) Show that, to quadratic order 
in (v/c), you get a quadrupole moment for temperature distribution. Determine 
the direction at which the quadrupole produces maximum temperature and its 
amplitude. 


Exercise 2.7 Going forward: 


Consider a source of radiation moving at v/c = 0.999. Show that: (i) radiation 
originally emitted at 45 degrees to the direction of motion is seen within 1 degree 
of the direction of motion. (ii) Let us assume that there was no radiation at 90 
degrees to the direction of motion. (This is what will happen, for example, if the 
charge is moving in a circular path with the acceleration being perpendicular to 
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the velocity.) Show that this “null direction” is at 2.5 degrees with respect to the 
direction of motion. 


Exercise 2.8 Clouds and sunsets: 


In an one dimensional geometry with genuine absorption of radiation,the intensity 
decays exponentially with the optical depth as e~*. But if the medium is only 
scattering the photons rather than actually absorbing them, then the situation 
is different. Since the photons are just performing one dimensional random walk 
in this case, it will satisfy the diffusion equation. The steady state solution to an 
one dimensional diffusion equation d?n/dl? = 0 varies linearly with l. Use this 
and appropriate boundary condition to show that the transmission coefficient of a 
rectangular slab of material decreases only as r~! (But for this, cloudy days would 
have been much darker.) Do you think similar results will hold in 3-dimensional 
geometry? 


Exercise 2.9 Pushed away lightly: 


Consider an optically thin cloud of material that is ejected by the radiation 
pressure from a nearby luminous object with a mass-to-luminosity ratio M/L. 
Show that the terminal velocity acquired by the ejected blob is given by 


5 (2.121) 


2_ 2GM K Lo, 
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Hence obtain the bound on M/L for the ejection to occur. 


Exercise 2.10 Temperature is not everything? 


The HII regions have temperatures very similar to effective temperatures of stars. 
Then how come they don’t shine in the visible band as brightly as stars? 


Exercise 2.11 Dissecting Orion nebula: 


Orion nebula has a radio flux which is F œ p? for v < ve % 1 GHz and a 
nearly flat flux for v > ve. At v = ve, the flux is about Fe œ 300 Jy. The 
nebula subtends 4 arcminute at a distance of 500 pc. You are expected to model 
this as bremsstrahlung arising from a region of ionized hydrogen. (Hint: Use 
Eq. (2.47).) Given the data on Orion nebula, estimate the temperature and the 
electron density. 


Exercise 2.12 Deep trouble: 


An electron moves in a circular orbit under the influence of a magnetic field. 
Because this is a accelerated motion, the electron dutifully emits synchrotron 
radiation. But, wait a second. Where does this energy come from, since magnetic 
field cannot do any work on charged particles? 


92 An Invitation to Astrophysics 


Exercise 2.13 Cosmic rays: 


Electrons in the cosmic rays have been detected with energies up to 3 x 107° eV. 
The diffuse magnetic field in the Local Group of galaxies (made of Milky Way, 
Andromeda and a few tens of other smaller galaxies), over a region of size of 1 
Mpc, is about 107” Gauss. What is the synchrotron energy loss time compared 
with the time to cross the Local Group, for the cosmic ray electron traveling 
perpendicular to the magnetic field. For comparison, compute the same for the 
galactic disk with a size of 1 kpc and magnetic field of 3 x 107 Gauss. What is 
the moral of the story? 


Exercise 2.14 Angular size of synchrotron source: 


Argue that if Compton losses should not be too large, then the angu- 
lar size of a synchrotron source of total flux F, is bounded by @ > 
107? arc sec (FL/Jy)*/? (ve/GHz)~9/"°. 


Exercise 2.15 Ironing it out: 


Compute the energy difference between n = 1 and n = 2 energy levels of an iron 
atom which has been stripped off all but but one of its electrons. (This is the 
Ka line used in x-ray astronomy.) Also find the ionisation energy of this system. 
(This is called the K-edge.) 


Exercise 2.16 Collisions and the Lorentzian profile: 


Using the argument given in the text, prove that collisions also lead to a 
Lorentzian profile for the line width. (Hint: What is the Fourier transform of 
6(t) exp(—T't/2)?] 


Exercise 2.17 Paradox? 


We argued that surface brightness is independent of distance. Then how come 
we use an inverse square law to describe the decrease of stellar flux? Why isn’t 
stellar surface brightness constant making the flux independent of distance? 


Exercise 2.18 Actually, it is quite simple: 


When photons and electrons interact through Compton and inverse Compton 
scattering, the evolution of photon number density n(v, t) in the frequency space 
is described by the equation 





On hnes: P 1 ð 4 kBT On 2 
== ss 2.122 
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called Kompaneets equation. This, rather formidable looking, equation is quite 
easy to obtain if you go about it the right way. (a) Show that Compton processes 
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conserve the number of photons requiring the evolution equation to have the 
general form 


On _ 10 2 

at v2 Ov 
where j(v) is the “radial” current in the moment space. (b) Using the results 
obtained in the text, you can now show that there are two pieces to this current. 
The Compton scattering, leading to (6v/v) ~ —hv/mec’, will give a drift velocity 
dv/dt = —neorhv?/mec. Show that this gives a current ndv/dt. (b) Argue that, 
in addition, there is also a diffusion in the frequency space giving j = —Ddn/dv 
with the diffusion coefficient D = (1/3)neorc((Av)”). Finally show that Av = 
v(v/c) and combining the two terms, derive the Kompaneets equation. 


jw) (2.123) 


Notes and References 


There are several text books which deal with radiative processes in great detail. 
For conceptual clarity, the best one is Landau and Lifshitz (1975a). Astrophys- 
ical applications are stressed in Shu (1991) and Rybicki and Lightman (1979). 
Detailed derivations of many of the results that are quoted without proof in this 
section can be found in these references. 

1. Section 2.1.1: This derivation of radiation field is based on Thomson 
(1907). A more accessible reference in which this derivation is repeated is Berkeley 
Volume 3, Crawford (1968). 
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Chapter 3 


Matter 


3.1 Equations of state 


In the study of different astrophysical systems, you have to deal with matter 
differing widely in its physical properties. Take, for example, the density: 
the intergalactic medium has 10~28g cm~%, sun-like star has 1g cm~* and 
a neutron star has 10'4g cm~* — with a dynamic range of 42 orders of 
magnitude! Similar variation occurs in many other parameters and — as a 
consequence — astrophysical systems come with matter existing in all the 
four different states. How do we tackle this enormous diversity ? 

It turns out that, by and large, the description of matter can be simpli- 
fied by the use of equation of state. (There are systems with large number of 
particles — like galaxies made of stars — for which one cannot write down 
an equation of state; such systems are, indeed, more difficult to model.) 
In general, equation of state allows you to determine the pressure in terms 
of other physical parameters, most importantly density and temperature. 
Since pressure determines the force balance, which determines the motion 
and dynamics, we can make considerable progress once we know the equa- 
tion of state. 

Let us begin with a system of N particles (N > 1), each of mass m, 
occupying a region of volume V. The pressure exerted by a sufficiently 
large number of particles can be obtained as the momentum transfered per 
second normal to a (fictitious) surface of unit area (which we take to be in 
the yz plane). The contribution to rate of momentum transfer (per unit 
area) from particles of energy € will be n(e)pz(€)uz(€) where n(e) denotes 
the number of particles per unit volume with momentum p(e) and velocity 
v(e). The net pressure is obtained by averaging this expression over the 
angular distribution of velocities and summing over all values of the energy. 
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Since the momentum and velocity are parallel to each other, the product 
Pos, averages to (1/3)pv (in 3-dimensions) giving 


P= : i * WA pOUOE (3.1) 


where the integration is over all energies. This result is of very general 
validity since it is purely kinematical. 

Of course, you can do something with it only if you know the functions 
p{e) and n(e). This is easiest for the systems called ideal in which the kinetic 
energy dominates over the interaction energy of the particles. In that case 
e is essentially the kinetic energy of the particle. Further, since we may 
want to deal with both relativistic and nonrelativistic particles, it is good 
to use the exact expression for kinetic energy. Using the relations 


y2\ 1? 
p=ymu; €=(y—-1)me?; y= (2 — =) ‘ (3.2) 


where € is the kinetic energy of the particle, the pressure can be expressed 


in the form 
o0 2 2\ 71 
P= F ne (1 + ae ) (1 + ze) de. (3.3) 
3 Jo € € 


This expression simplifies significantly in the following two limits: non- 
relativistic (NR) and extreme relativistic (ER). In the NR limit (with 
mc? > e), Eq. (3.3) gives Pyr œ% (2/3)(ne) = (2/3)Unr where Unr is 
the energy density (i.e., energy per unit volume) of the particles. In the ER 
case (with € >> mc? or when the particles are massless), the corresponding 
expression is Per % (1/3)(ne) = (1/3)UgR. Hence, in general, P ~ U up 
to a factor of order unity. In these two limits, the kinetic energy has the 
limiting forms: 


2 . 
c= Pat md — me = f A E 





pe (p > mc; ER). 


To proceed further and determine the actual behaviour of the system we 
need to know the origin of the momentum distribution of the particles. The 
familiar situation is the one in which short-range interactions (‘collisions’) 
between the particles effectively exchange the energy so as to randomize 
the momentum distribution. This will happen if the effective mean free 
path of the system l is small compared to the length scale L at which 
physical parameters change. When such a system is in steady state, one 
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can assume that local thermodynamic equilibrium, characterized by a local 
temperature T', exists in the system. Then, the probability for occupying a 
state with energy E is given by the usual Boltzmann distribution, P(E) œ 
exp[—(£/kgT)], and the mean energy will be of the order of kgT. Using 
Eq. (3.4) with € ~ kpT we get: 


QmkpT)/? (kBT & me; NR 
e at ste NE) (3.5) 


~ [(keT/c) (kBT > me; ER). 


In this case, the momentum and kinetic energy of the particles vanish when 
T-—- 0. 

This limit contains, in particular, the important special case of a non 
relativistic ideal gas with the equation of state PV = NkgT which cor- 
responds to € = (3/2)kgT and Ung = Ne. Considering its importance, 
we will a spend a moment, recalling its key properties. Depending on the 
context, astronomers write the equation of state for ideal gas in several 
equivalent forms: 





N Nmy kB R 
= Pe ee wee = —(pT $ 
Payg A m ee) 
with 
_ Vp. _ Ap _ 
H= Nmi’ R= my = Nakpg (3.7) 


where we have defined the gas constant R = (kg/myu) = Nakpg, the Avo- 
gadro number N4 = my and the mean molecular weight u. The inverse of 
the molecular weight gives the effective number of particles per hydrogen 
atom of the gas. Clearly the value of u depends on the number of particles 
contributed by each atom which, in turn, will depend on whether the atom 
is ionized or not. (Note that both free electrons and ions contribute to 
pressure while the mass is mostly contributed by the ions.) It is therefore 
easy to see that 





1 EEN 
arr | A; ) Xs 


where A; is the atomic weight of the jth species of the particles, X; is 
the fraction of the total mass contributed by the jth species and Z; is the 
effective number of electrons supplied by each atom of the jth species. If 
we are interested only in the pressure contributed by electrons (or ions), we 
can use the equation Eq. (3.6) with u replaced by Hele (or Hion) respectively. 
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For example, the number density of electrons in a hydrogen plasma with 
the ionisation fraction ry will be 


Ne = (4) ur! & (56) (1+ Xx) (3.9) 


MH 


which defines 4e and is a useful relation in estimating processes related to 
electrons. Finally, we recall for future reference that the entropy of an ideal 
gas can be expressed in the form: 


S=Cyb[TVO-Y) œ jT ODV] & InfPV] x Inf[PTVO-Y} (3.10) 


where y = Cp/Cy is the ratio of the specific heats. For an ideal mono- 
atomic gas, y = 5/3. It follows that for adiabatic processes, PV” as well 
as the other combinations inside the square brackets in Eq. (3.10) are con- 
stant. The equivalent relation, P « p7, for adiabatic processes, will be used 
extensively in the later chapters. 

All this is fine for classical particles for which momentum and energy 
vanish at zero temperature. Quantum mechanical fermions, like electrons, 
behave differently. The mean energy of a system of electrons will not 
vanish even at zero temperature, because electrons obey Pauli exclusion 
principle which requires that the maximum number of electrons which can 
occupy any quantum state is 2, one with spin up and another with spin 
down. Since uncertainty principle requires AxAp, Z, 27h, we can as- 
sociate (d°xd°p)/(27h)* micro-states with a phase space volume dz d°p. 
Therefore, the number of quantum states with momentum less than p is 
[V (4rp?/3)/(2rħ)’] where V is the spatial volume available for the sys- 
tem. The lowest energy state for a system of N electrons will be the one 
in which the electrons fill all the levels up to some momentum pr (called 
Fermi momentum). This gives, 


n= (>) = eee = a (y (3.11) 


so that pr = A(3r?n)!/3. Hence, at T = 0, all the states with p < pp = 
A(3r?n)!/3 will be occupied by electrons; that is, even at T = 0 electrons 
will have nonzero momentum (and energy). Further if pr is of the order of 
or greater than mc (which can happen if n is large enough), then a fraction 
of electrons will be relativistic, even at T = 0! 

The above analysis, of course, is strictly valid only at T = 0. At finite 
temperatures, electrons will also have a thermal energy kgT which should 
be compared with the energy ep (called Fermi energy) corresponding to 
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pr to decide whether the system is classical or quantum mechanical. The 
quantum mechanical effects will be dominant if eF Z kgT (and the system 
is called degenerate) while the classical theory will be valid for er < kgT 
(non degenerate). This energy er will, in turn, depend on whether the 
system is NR or ER: 


_{ B= (B) Grn)” R) 

gpa ie (3.12) 
pre = (hc) (37?n) (ER). 

(Note that the result pp = A(3r?n)!/3 is independent of whether the par- 
ticles are relativistic or not. But the relation between the energy and the 
momentum has different limiting forms in these two cases and hence the 
dependence of ep on n is different.) The ratio (er/kgT), which determines 
the degree of degeneracy (that is, how quantum mechanical is the system 
under consideration) is 





( z ) = oe os (3.13) 


kpT 2)" (Ep fic_» 1/3 (ER) 


with the two limiting forms being valid for n « (h/mc)~* (NR) and n > 
(h/mc)~* (ER) respectively. In the first case with 





PENE: 
n & (= -) x= 10°! cm™?; p= mpn < 10” gm cm~’, (3.14) 


the system is non relativistic; it will also be degenerate if (e7/kgT) >> 1 and 
classical otherwise. The transition occurs at ep ~ kgT which corresponds 
to 


3/2 
nT~3/? = eer = 3.6 x 1076 (3.15) 
in c.g.s. units. To see what these numbers mean, let us estimate the ratio 
R = (kpT/er) x (mkgT/h?n?/3) for some astrophysical systems. The 
solar wind (n ~ 10 cm~°,T œ 10° K) has R ~ 10% which makes it totally 
classical; at the center of the Sun (n ~ 1076 cm~?,T ~ 107 K) we get 
R ~ 5. This is uncomfortably close to unity and one should realize that 
quantum effects are not completely ignorable at the center of the Sun. At 
the center of a typical white dwarf (n ~ 103° cm~3,T ~ 10° K) we have 


R ~ 10-3 making it totally quantum mechanical. 
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In the second case in Eq. (3.12) (with n > (i/mec)~? œ 10°! cm~’; p = 
Mpn > 10” gm cm~%), electrons have pr >> mec and are relativistic irre- 
spective of the temperature. The quantum effects will dominate thermal 
effects if kgT < (he)n'/3 and we will have a relativistic, degenerate gas. 
Figure 3.1 summarizes the different regions in the n — T plane. 

In general, the kinetic energy of the particle will have contributions from 
both the temperature as well as Fermi energy and one needs to use the full 
Fermi-Dirac distribution. But if we are only interested in the asymptotic 
limits, one can approximate the total kinetic energy per particle to be 
eer(n)+kgT. Note that such a system has a minimum energy Ner(n) 
even at T = 0. 
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Fig. 3.1 Regions with different equations of state in the n — T plane. 


Let us now get back to the pressure exerted by such a gas of electrons. 
Using our general result P ~ ne in Eq. (3.3), we can obtain the equation 
of state for the different cases discussed above. First, if the system is 
classical with kpT >> ep, so that € = kgT then P ~ nkgT in both NR 
and ER limits. This is just the ideal classical gas of particles or, say, 
photons. Second, for a quantum mechanical gas of fermionic particles with 
kBT < ep, and € %5 ef, it follows from Eq. (3.12) that P ~ nep. This 
pressure varies as the (5/3) rd power of density in the non-relativistic case 
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and as (4/3) rd power of density in the relativistic case. [Remember that 
whether the system is relativistic or not is decided by the ratio (pr/mc) 
or — equivalently — the ratio (e/mc*) with the transition occurring at 
n = (ħ/mc)~?.] Including all the proportionality factors, we get, in the non 
relativistic limit: 


(3n2)2/3 h p 5/3 
Pyr = =—— — | — = ànR p (3.16) 
5 Me \ Mp 
and in the relativistic limit: 
tiay p 4/3 
P= 1/3 = 4/3 
R rics yl fie (2) Arp”. (3.17) 


In general, if one writes the pressure in terms of density as P œ p7, then 
y is (5/3) in the non relativistic limit (pp « mc) all the way close to 
about pr © 0.6mc; it decreases rapidly to y = 4/3 in the transition range 
(0.6 — 10)mc and stays close to 4/3 for pr > 10mc. 


3.2 Self-gravitating barotropic fluids 


Let us now consider an important application of the above results. One of 
the common situations in astrophysics corresponds to the case in which the 
matter remains in static equilibrium with the pressure gradient balancing 
the gravitational force. In this case, the conditions of equilibrium become 


VP =-pV¢; V?¢=4nGo (3.18) 


where the second equation assumes that the fluid is self-gravitating; ie., the 
gravitational force is only due to the mass density of the fluid. Combining 
these equations we get V -(VP/p) = —4rGp. For a non rotating body 
in thermal equilibrium, P and p will be spherically symmetric and this 
equation reduces to: 
2 

ae (SF) = —AnGp. (3.19) 
We cannot, of course, integrate this equation until another relation between 
P and p is given. Usually, such a relation will involve the temperature 
thereby requiring further equations to close the systems. But there is one 
special situation of considerable importance in astrophysics in which pres- 
sure can be expressed purely as a function of density like in the two cases in 
Eq. (3.16) or Eq. (3.17). (Such fluids are called barotropic.) If the system 
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is barotropic, we will have a relation P = P(p) and the above equation can 
be immediately integrated. 

One simple choice, which has applications in several astrophysical con- 
texts, is given by the polytropic equation of state for which 


P = Kp*)/" = Kp (3.20) 


where K and y = (n+1)/n are constants. In this case, it is possible to use 
scaling arguments to obtain several important conclusions regarding the 
equilibrium configuration. 

To do this, let us transform the variables in Eq. (3.19) to an independent 
variable € and a dependent variable @ by the relations p = p,6",r = a 
where pe = p(0) is the central density and the constant a is chosen to be 





1/2 
n+1 
= Kote : 3.21 
a kee Pè (3.21) 


Straightforward algebra now converts Eq. (3.19) to a differential equation 
for 0(£): 


1 d (,.d0 n 

za (CE) =~ eo 
called the Lane-Emden equation. We need two boundary conditions for 
integrating this equation which are easy to determine: From the definition, 
it is clear that we need 0 = 1 at € = O in order to produce the correct 
central density. Further, since the pressure gradient (dP/dr) has to vanish 
at the origin, it is necessary that (d6/dé) = 0 at € = 0. Given these two 
boundary conditions at the origin, Lane-Emden equation can be integrated 
to find 6(€). The integration proceeds from the origin till a point € = & at 
which @ vanishes for the first time. At this point, the density and pressure 
vanishes showing that the surface of the physical system has been reached. 
All other physical variables can be expressed in terms of the behavior of 
O(€) near € = &1. 

Analytic solutions to Lane-Emden equations are known for the cases 
with n = 0,1,5 corresponding to y = œ, 2,6/5. Denoting the solution for 
a particular value of n by @,,(€), you can easily verify that 

_ sing 


b=1-£/6; h= T; h= Grea A 


(3.23) 


The solutions for n = 0 and n = 1 vanish at £; = v6 and ĉ& = 7 respec- 
tively. The case n = 0 corresponds to a sphere of constant density and 
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the case n = 1 (corresponding to y = 2) is of some relevance in modeling 
planetary interiors and brown dwarfs (see Exercise 3.2). On the other hand, 
for n = 5, @5(€) does not vanish for any finite value of € and the matter 
distribution extends to infinity. Such a system is not of much relevance in 
describing gaseous spheres like stars but is used — in a different context — 
to model galaxies. 

Using the transformation equation between the original variables and 
6 and £, we can express all other physical parameters in parametric form. 
The radius and mass of the system are given by 


n+lk 1/2 Ln) son 
R= ag = [CE] pog, (3.24) 


dé (n+1)K ere d0 
= 3 epee = AEE (8-n)/2n | _ e22 
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=¢ 
(3.25) 
The mean density = (3M /4r R?) is related to the central density by 
3 d0 
Pp=Pe|-z> ‘ 3.26 
=e alte (228) 
The central pressure and the gravitational binding energy of this sphere are 
-1 
GM? dp? 
P= Ri smn 1) (2). ; (3.27) 
zü 3. _ 3 GM? 


Eliminating p; between Eq. (3.24) and Eq. (3.25) we can find the mass- 
radius relationship for these systems: 


M x RE 0-0, (3.29) 


Relation like Eq. (3.29) can also be obtained more easily from scaling 
arguments. In this particular case, for example, equating VP ~ (P/R) to 
(GM p/ R?) and using P œ p!!+@/™)), we get pt/” « (M/R). Further, using 
p x (M/R3), one obtains the scaling M «x R-7)/0-»), This relation is 
independent of the detailed nature of the solution and reflects the fact that 
Lane-Emden equation allows self similar solutions. Note that, for n < 1, the 
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M(R) is an increasing function while for 1 < n < 3, the mass M decreases 
with increasing R. 
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Fig. 3.2 The variation of the radius and mass of Lane-Emden spheres of different indices 
parameterized by the central density pec. 


There are two cases of astrophysical significance in which the pressure is 
provided by degenerate fermions. This situation arises in white dwarfs and 
neutron stars at the end of stellar evolution. The equation of state is then 
given by Eq. (3.16) or Eq. (3.17) depending on whether the system is non 
relativistic or relativistic. In the first case, y = 5/3 and n = (y—-1)~! = 3/2. 
Then Eq. (3.29) gives M x R7? with more massive systems having smaller 
radius. On the other hand, for 7 = 4/3, we have n = 3 and the scaling 
in Eq. (3.29) shows that M is independent of R; this value represents a 
transition point. We will see in chapter 5 that the limiting value of M 
plays an important role in the study of neutron stars. 

When the Lane-Emden index n tends to infinity, the polytropic index 
goes to unity and the equation of state becomes P = Kp. Such a system 
is called an isothermal sphere (since P = pkgT with a constant T leads 
to this relation). Since polytropic solutions become difficult to handle as 
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n — œ, it is convenient to approach the isothermal sphere in a different 
manner. 

Consider a self gravitating ideal gas with constant temperature T, den- 
sity p(r), pressure P(r) = p(r)kgT and a gravitational potential ¢(r) such 
that the density at any location is given by 


p(r) = pe exp(—B[¢(r) — $(0))) (3.30) 


with pe denoting the central density. This density distribution corresponds 
to a system in Boltzmann distribution at constant temperature. Combining 
Poisson equation with Eq. (3.30), we get a differential equation for the 
gravitational potential: 


V2b = 4nGp-e FIP) FOI, (3.31) 


In practical applications, isothermal sphere arises when the source (gas of 
molecules or ‘gas’ of stars, ....) is described by a distribution function 


fle) = TE exp (-5) ' (3.32) 


where € = (1/2)v?+¢. This distribution is parameterized by two constants 
po and a. One can easily verify that the mean square velocity < v? > is 
30°? and that the density distribution is p(r) = po exp(—¢/c7) with the 
central density pe = po exp(—4(0)/o7). It is conventional to define a core 
radius ro and a set of dimensionless variables by 


2 \1/2 
ro= (Ze ) Sawer, ee (3.33) 
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Then the Poisson equation V2¢ = 4rGp can be rewritten in the form 


1d (Pa _ 
ai (=$) = -9€. (3.34) 


This equation has to be integrated numerically with the boundary condition 
¿(0) = 1 and €’(0) = 0, to determine the density profile. Given the solution 
to this equation, all other quantities can be determined. 

Let us consider the nature of solutions to this equation. Remarkably 
enough, there exists a simple exact solution to this equation. By direct 
substitution, you can check that 


Esing (I) = a (3.35) 
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satisfies this equation. This solution, however, is singular at the origin 
and hence is not physically admissible. (It clearly violates the boundary 
conditions at the origin.) The importance of this solution lies in the fact 
that other (physically admissible) solutions tend to this solution for large 
values of r. Thus all the exact solutions have the behaviour p œ r~?, 
M(r) xr at large r; the model has to be cut off at some radius to provide 
a finite mass. The numerical solution to Eq. (3.34) starts at unity near 
l = 0 and behaves like the solution in Eq. (3.35) for large l. For | < 2, the 
numerical solution is well approximated by the function £(1) = (1 +12)~?. 

Very often in astronomy, what is measured is the 2-dimensional density 
distribution obtained by projecting the 3-dimensional density distribution 
on to the plane of the sky. The projected, two dimensional, surface density 
corresponding to this €(1) is S(R) = 2(1+ R?)~!, where R is the projected 
dimensionless radial distance corresponding to J. It follows that the (true) 
central surface density 2pero where pe is the central volume density and ro 
is the core radius. 


3.3 Flows of matter 


The entire subject of fluid dynamics revolves around understanding the 
motion of fluids in different circumstances which is essentially governed by 
a continuity equation and Euler equation. You will find that, curiously 
enough, the simple, tractable, forms of fluid motion are rarely of interest 
in astrophysics! Either the motion is supersonic or it is turbulent or there 
is no motion at all (like in the cases discussed in the last section, in which 
the gravitational force is balanced by the pressure gradient). We shall 
now describe some of non trivial examples of fluid flow which occur in 
astrophysics, beginning with the important example of accretion. 


3.3.1 Spherical accretion 


Accretion of matter plays an important role in several astrophysical sys- 
tems. It also provides a simple example in which a transition from subsonic 
to supersonic flow occurs under appropriate conditions and illustrates some 
peculiar features of supersonic flow compared to subsonic ones. 

In the normal subsonic situations, a fluid flowing through a nozzle of 
variable area will speed up in narrow constrictions and slow down when the 
cross-sectional area is large. In supersonic flow, a fluid will actually speed 


Chapter 3: Matter 107 


up when the area of cross section increases! To see this, let us write down 
the equation governing the one dimensional fluid flow 


BD ai OD yd A a (3.36) 

dt dx p dz p dz 
where cs = (dP/dp)'/? is the speed of sound and we have written P = c2p 
for the equation of state. This gives, dln p = —(v?/c?)dInv. On the other 
hand, from the conservation of mass, we have pvA = constant where A is 
the cross-sectional area. Taking the logarithmic derivatives of this equation 
and using the omelia for dInp, we get dlnv = —(1 — v?/c?2)-!din A. 
This shows that if v? > c?, then decrease in the cross-sectional area leads 
to decrease in flow velocity and vice-versa. You can see that our usual 
intuition is built on subsonic flows and needs to be corrected for supersonic 
phenomena. 

A simple example of supersonic flow occurs in the case matter of accret- 
ing to a central object. Consider a spherical object of mass M and radius 
R embedded inside a gas cloud. The gas flows towards the central mass due 
to gravitational attraction. In a highly idealized situation, we may consider 
the flow to be radial at some steady rate M. Let us further assume that 
the gas is ideal and polytropic with an index y where 1 < y < 5/3. For 
a steady, spherically symmetric flow, mass conservation requires pur? be 
a constant, where v is the radial component of the velocity. We will set 
M = —4nr*pv to be the constant accretion rate. Logarithmic derivatives 
of this equation give: 

Z- = a (ur?) . (3.37) 
In the Euler equation, p(dv/dt) = —V,;P — Vr@ we can write (dv/dt) = 
(Ou/Ot) + v(Ov/Or) = v(ðv/ðr) for a steady, time independent flow. Fur- 
ther, if the force arises from the gravitational field of the central mass then 
—V,¢ = —GM/r?. Therefore, the Euler equation becomes: 

ðv 10P GM 

ve tae tor =O. (3.38) 
Writing (3P/ðr) = (OP/dp) (Op/Or) = c2 (Op/Or), where cs = (OP/dp)!/? 
is the speed of sound and using Eq. (3.37), Eq. (3.38) becomes: 


GM 


2 
OU 658 +2 =0. (3.39) 
z 


OV CS O p 2 
Yar vr? Or WE) 
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This can be rearranged into a more transparent form: 


1 c2\ dv? GM Qc2r 
3 (1-5) Ge ee (1- zit (3:20) 


that allows us to understand several features of accretion. 

To begin with, note that c2 will tend to some constant value at very 
large distances from the central object. (In general, of course, c? depends 
on p and hence on r). Therefore, the factor (1 — 2c?r/GM) will be negative 
definite at large r, making the right hand side of Eq. (3.40) positive at large 
r. In the left hand side, we want (dv?/Or) to be negative because we expect 
the gas to be at rest at large distances and flow with an acceleration as it 
approaches the central body. Hence v? < c? at large r; ie., the flow is 
sub-sonic at large distances. As the gas approaches the central object, the 
factor (1 — 2c?r/GM) will go to zero at some critical radius re. For the 
polytropic equation of state P « p7 we have c? = yP/p = ykgT; then, this 
critical radius is about 


_ GM ty lt 
Te = 32 (re) ~ 7.5 x 10 (x) Mo cm. (3.41) 


This radius is much larger than the size of typical central masses encoun- 
tered in astrophysics around which accretion takes place. Hence we must 
consider the flow at r < re as well. Repeating the above analysis for r < re 
we can conclude that v? > c2. Hence the flow must make a transition from 
sub-sonic to supersonic speeds at r = re. Further, at r = re, we must have 
either v? = c? or (Ov?/dr) = 0 

All the solutions to Eq. (3.40) can now be formally classified based on 
their behaviour near r = re and are shown schematically in Fig. 3.3: (1) 
The first type of solutions has v? = c? at r = re, v? — 0 as r — oo; further, 
v? < c? for r > re and v? > c? for r < re. This is the relevant solution 
that describes accretion which starts out as sub-sonic at large distances 
and becomes supersonic near the compact object. (2) Consider next the 
solutions with v? = c? at r = rev? — 0 as r > 0; v2 > c? for r > r, and 
v? < c2 forr < re. This represents a time reversed situation to spherical 
accretion describing a wind flowing from the compact object. (We shall 
discuss this case briefly in Sec. 4.4 as a model for the solar wind.) The flow is 
sub-sonic near the object and becomes supersonic at large distances. There 
are also solutions with either (3) (@v?/dr) = 0 at r = rejv? < c? for all r 
or with (4) (3v?/ðr) = 0 at r = re; v? > c? for all r. These solutions are 
either sub-sonic or supersonic in the entire range and hence cannot describe 
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Fig. 3.3 Classification of possible fluid flow patterns in the case of spherical accretion. 


accretion onto a compact object of sufficiently small radius. Finally one can 
have solutions with either (5) (3v?/ðr) = oo when v? = c?;r > re always or 
with (6) (Ou?/dr) = co when v? = c?;r < r, always. These two solutions 
are unphysical because they give two possible values for v? at any given r. 
Thus we conclude that only the type 1 solution is relevant to the problem 
of spherical accretion on to a compact object. 

To proceed further, we integrate Eq. (3.38) using (OP/0p) = y (P/p) = 
c? to obtain: 


Hr), A) _ GM 
2 Cra) 


Since we must have v? — 0 asr — oo, the constant must be c? (oo) / (y — 1). 
On the other hand v? (re) = c2 (re) and (GM/re) = 2¢2 (re). Using these 


= constant. (3.42) 
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relations we can relate cs (Te) to cs (00) getting 


1/2 
es (re) = cs (00) |g | (3.43) 


provided 1 < y < (5/3). Since c2 « p’~!, we also get 


Cs (Tc) 


pre) = p (00) SE (3.44) 


2/(y~1) 
Finally, since r?pv is a constant , we can write M = Anr2p (re) Cs (re), using 
which we can relate the accretion rate M to the conditions of the gas at 
infinity. Straight forward algebra gives 


2 (5-37)/2(y—-1) 
5- 3y i 


tes. 2472 P(X) 
M = rG M Ao) 








(3.45) 


Numerically, this corresponds to the accretion rate 


! a [MN?’/_ pœ) ) (en j 
=1.4x 10"g s7! ( —— z 3.46 
a OES (=) (eo tr | is) 
if y = 1.4, say. To complete the solution we find v (r) is terms of c, (r) from 
the conservation of M. This gives 


pn M ow M Paleo 
"= Anr?p(r)  4nr?p (00) ( cs (r) ) 


Substituting this relation into Eq. (3.42) gives an algebraic equation for 
c? (r). Since this equation has fractional exponents it has to be solved 
numerically. Given c2(r), Eq. (3.47) determines u(r) and the relation M = 
r*(r)u(r) fixes p(r). This completes the solution to our problem. We see 
that there is a steady flow of matter to the central object which starts out 
subsonically but makes a transition to supersonic flow at a critical radius. 

The spherically symmetric accretion is idealistic for several reasons; one 
reason being that we ignored the existence of magnetic fields in the central 
regions close to the compact objects. If the infalling matter is a plasma with 
free charged particles (as it is very likely to have) the magnetic fields will 
also exert a force on the accreting matter. If we take the field to be that of a 
dipole with B(r) = B,(R,/r)? where B, is the magnetic field at the surface 
of the central mass of radius R,, then the magnetic pressure (which is equal 
to the energy density in the magnetic field) is Pmag(r) = B? /87 œ r76. The 
infalling matter exerts an inward ram pressure of the order of pv? which 





(3.47) 
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is balanced by the magnetic pressure at some radius r = rm (called Alfven 
radius) at which pu? = B?/87. Using M = 4nr?pv and v? = 2GM/r we 


get 
1\/7 BiR? 1/7 


For a 1Mọ neutron star with L = Lg, (see Eq. (2.38)) B, = 10!? G and 
Rs = 10 km, we find the Alfven radius to be about rm = 1200 km = 120 
Rs. Obviously, one cannot ignore magnetic fields close (r < rm) to the 
compact object. 





3.3.2 Accretion disks 


In several astrophysical contexts, the infalling matter will have non-zero 
angular momentum which implies that the accretion cannot be spherically 
symmetric. The opposite extreme — which is often closer to reality — 
is disk accretion in which the matter forms a, planar, disk like structure 
around the central object and undergoes accretion. Obviously, there must 
be some processes which transfer angular momentum outward from the 
infalling matter, if this has to occur. We shall now describe some basic 
features of such a thin accretion disk in which the particles at any given 
radius move in a nearly Keplerian orbit to a high degree of accuracy. 

Let us consider the radial structure of such a disk in steady state, as- 
suming that some mechanism for angular momentum loss exists in the 
disc. Because the disk is assumed to be thin, the velocity of gas has 
only the radial component vp and azimuthal component vg with uy < ug. 
The amount of mass M crossing any radius R per unit time is given by 
—2nR¥v,(R) = M, where (R) is the surface density of the disk (ob- 
tained by integrating the mass density along the direction normal to the 
disc, say the z-direction). In a Keplerian disc, in which the particles move 
in nearly circular Keplerian orbits at any radius, the angular velocity of ro- 
tation varies as 2(R) = (GM/R*)!/2, so that the specific angular momen- 
tum varies as R?Q. œ R1/? which decreases with decreasing R. Therefore, 
accretion through successive Keplerian orbits towards the central mass is 
possible only if the gas can constantly lose angular momentum due to some 
viscous torque. If we assume that the viscous force per unit length around 
the circumference at any R is given by F = vi(RdQ/dR) where v is some 
unspecified coefficient of kinematic viscosity, then the viscous torque around 
the whole circumference will be G(R) = (2aRF)R = vU2nR3(dQ/dR). 
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The direction of the torque is such that the fluid at radius less than R, 
which is rotating more rapidly, feels a backward torque and loses angu- 
lar momentum while the fluid at radius larger than R gains the angular 
momentum. 

To determine the radial structure of the disk we have to equate this 
torque to the rate of loss of specific angular momentum. Consider an an- 
nular ring of material located between the radii R and R + daR. In unit 
time, a mass M enters the ring at R+dR with specific angular momentum 
(R + dR)*[Q(R + dR)| and leaves at R with specific angular momentum 
R?{Q(R)|. Thus the net angular momentum lost by the fluid per unit time 
in this ring is M[d(R?Q)/dR]dR. This angular momentum is lost because 
of the torque acting at both R and R + dR whose net effect is (dG/dR)dR. 
This leads to the equation 


m daR dR 





. d( R2 
GaL E [vz znr] (3.49) 


Using Q(R) = (GM/R?)"/2 and integrating, we get 


ais RV? £ MR1/2 





+ constant. (3.50) 


The constant of integration needs to be fixed using an appropriate boundary 
condition. One possible way is to assume the existence of an inner radius Ry 
at which the shear vanishes. So the right hand side of the above equation 
should vanish at Rẹ which determines the integration constant. We get 


vE = 4 h = (3) | (3.51) 


The actual value of R, will depend on the specific scenario. For example, if 
the compact object is a black hole, it is natural to take R, to be the radius 
of the smallest stable circular orbit, R, = 6(GM/c?), (as we shall see in 
chapter 5). In the case of accretion on to an unmagnetized white dwarf 
or neutron star, R, can be taken to be the radius of the compact object. 
In the case of a magnetized compact object, the situation is much more 
complicated and R, is determined by the balance between the pressure of 
the magnetic field and ram pressure and will be approximately equal to the 
Alfven radius. 

Given the radial structure of the disc, one can immediately compute the 
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viscous dissipation rate per unit area, given by 


dQ\? 3GMM RAY 





The energy released between R and R + dR is about (3GMM/2R?)dR 
for R > R, which is 3 times larger than the change in the orbital energy 
(GMM /2R?)dR. The excess actually comes from the energy released at 
smaller radii and transported to R by viscous forces. The corresponding 
total luminosity, obtained by integrating D(R), is 


oo MM 
L= D(R)27RdR= z 
R, 2 Rx 





(3.53) 


Note that L is half of the total gravitational potential energy lost in the 
drop from infinite distance to R,. The other half of the potential energy is 
actually present in the form of kinetic energy of the fluid at R, and is not 
available for dissipation as heat. In the case of accretion to a black hole, 
this energy is lost inside the hole while for accretion onto a compact object 
with a surface, this energy is released in a boundary layer or in an accretion 
column. In general, such a dissipation is fairly complex and depends on the 
details. 

A thin disk in steady state radiates energy through its top and bottom 
surfaces. If one assumes that the spectrum is a blackbody with surface 
temperature T,(R), then we must have 20T4(R) = D(R), (the factor 2 
arising from the existence of two sides to the disc), implying 


ayy" 


= 6.8 x 105n71/4 ( 





3GMM 
5) 8rR3o 








1/2 

) Lig R! (x)? K (3.54) 
Leda 
where 7 = (Lg/ M gc?) is the radiative efficiency in the rest mass units 
(expected to be about 0.1), x = (c?R/2GM), R = [1 — (R./R)*/?] and 
Lag = (L/10*° erg s~'). Far from the inner edge, R = 1, T, «x x~9/4 œ 
R-3/4 which is the characteristic dependence of temperature on the ra- 
dius for the thin accretion discs. The overall temperature scale varies as 
(M/M?)"/4 œ (L/Leaa) 2L", 

This analysis applies to a thin accretion disk in which we are ignoring 
the vertical thickness of the gaseous disk. To see what this approximation 
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means, note that the vertical z—component of the acceleration at the height 
z above the plane of the disk at a distance R from the center, is given by 
gz = (GM/R?)(z/R). In pressure equilibrium, we must have dP/dz = 
—pgz. Taking an isothermal equation of state P = pc?, this equation can 
be easily integrated to give p(z) = p(0) exp[—(?z?/2c?)] where Q is the 
angular velocity in the disk. The vertical scale height h = /2c,./Q = 
/2c;R/vg determines whether the disk is sufficiently thin; the condition 
being h/R *cs/vg should be small compared to unity. 


3.3.3 Shock waves and explosions 


Another kind of flow, that occurs frequently in astrophysical systems, is the 
propagation of shock waves. Shocks arise in several contexts in which there 
is a supersonic flow of gas relative to a solid obstacle. In the laboratory 
frame, this could arise either because of an object moving through a fluid 
with supersonic velocities or because gas — flowing with supersonic veloci- 
ties — suddenly encounters an obstacle. To obey the boundary conditions 
near the solid body, the flow has to change from supersonic to sub-sonic 
somewhere near the object. But this information cannot reach upstream 
where the flow is supersonic. Hence the pattern of flow must change very 
rapidly over a short region resulting in a shock wave. In the frame in which 
the gas is at rest, we may consider the solid body to be moving with su- 
personic speed and pushing the gas. This necessarily leads to steepening 
of density profile at some place ahead of the body. We shall now consider 
some of the properties of such shock fronts. 

Let us idealize the surface of discontinuity as a plane with the gas flow 
occurring perpendicular to it with the flow being supersonic on the left 
side of the shock, say. It is convenient to study the situation in a frame 
in which the shock front is at rest. Let P1, p1, u1 denote the state of the 
gas to the left of the shock front and let P2, p2,ug be the corresponding 
values on the right side. The physical quantities on the two sides of the 
shock front are related by the conservation of mass, energy and momentum. 
The conservation of mass gives p1uı = pote = j. The momentum density 
of the fluid changes from (p1u1)ui = p1u? to (p1u1)ue across the shock; 
since this has to be balanced by the discontinuity in the force per unit area 
(i.e., discontinuity in the pressure) acting on the fluid element, we get the 
second conservation law P; + piu? = Pz + p2u%. Finally let us consider the 
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conservation of energy. The energy flux on either side is given by 
1 1 P, 
En = pig (Ge + =) S n=1,2 (3.55) 


where the first term in the bracket is the kinetic energy per unit mass 
and the second term is the internal energy per unit mass (see Eq. (1.53); 
e was internal energy per unit volume so that €/p is internal energy per 
unit mass). When y = 5/3 we get the standard result of (3/2)(P/p) = 
(3/2)kpT. The discontinuity in the energy across the shock must be equal 
to the net rate of work done by the pressure difference, requiring Hy — E1 = 
Piui — P2u2. Simple algebra shows that this condition can be rewritten as 
a conservation law: 


1 P 
Zu? + ——Ż = constant. (3.56) 
2 y-lp 

These three conservation laws — pu = constant, P + pu? = constant 


and Eq. (3.56) — are good enough to determine the three quantities 
(P2/P1),(p2/p1) and (T2/T;) in terms of y and the Mach number Mı = 
u1/Cs on one side where c? = yP/p is the speed of sound in the medium. 
The result of solving the conservation equations, by straightforward (but a 
bit lengthy; try it!) algebra, is 


p2 (1+1) M? u 


p MED w ag 
Pa 1+7(2Mi-1) 
AR omn oe) 
Te _ (oy - DM5 + Al +7 (2M5 -1)) (3.59) 


Tı (y+? M? 


Since we have assumed the flow to be supersonic on side 1 we have Mı > 1; 
it is clear from the above equations that Pz > Pi,p2 > pi,Ue < ur and 
Tz > Tı. The strongest shock which one can have corresponds to the 
limit Mı — oo. In this case P) >> Pı and p2/p1 — (y + 1)/(y — 1). If we 
transform to a frame in which the shock is moving with a speed v2 = u2— u1 
then the above conditions give 


2 +1 2 
v = u; M= je 2 = pj. (3.60) 


(+1) (oT): (y¥+1) 
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A simple example of such a shock wave occurs in the case of an “ex- 
plosion”. An explosion releases large amount of energy E into an ambient 
gas in a time-scale which is small. This will result in the propagation of a 
spherical shock wave centered at the location of the explosion. The result- 
ing flow pattern can be determined everywhere if we make the following 
assumptions: (i) The explosion is idealized as one which releases energy E 
instantaneously at the origin. The shock wave is taken to be so strong that 
the pressure P} behind the shock is far greater than the pressure P, of the 
undisturbed gas. (ii) The original energy of the ambient gas is negligible 
compared to the energy E which it acquires as a result of the explosion. 
(iii) The gas flow is governed by the equations appropriate for an adiabatic 
polytropic gas with index y. 

For such a strong shock, Po >> P, and we can ignore Pı. In the same 
limit, (92/91) = (y +1)/(y-— 1) and hence p2 is completely specified by 
pı. Thus the strong explosion is entirely characterized by the total energy 
E and the density pı of the unperturbed gas. Let the radius of the shock 
front at time t be R(t). Using E, pı and t we can find only one quantity 
with the dimension of length, viz. (Et?/p,)!/5. Hence, we must have 


R(t) = Ro ey". (3.61) 


pı 


where Ro is a constant to be determined by solving the equations of motion. 
All other physical quantities can now be determined in terms of R (t). The 
speed of shock wave with respect to the undisturbed gas is 

u = = = = = = RoB! pr 54-818, (3.62) 
The pressure Po, density p2 and the speed of shock propagation, vg = 
ug — U1, (relative to a fixed coordinate system at the back of the shock) 
can be determined using our junction conditions in Eq. (3.60) across the 
shock front. The density remains constant while v2 and Pz decrease as 
t-3/5 and t78/5 respectively. We will find these results (usually called the 
Sedov solution) are useful in modeling supernova remnants (see chapter 5). 


These scaling laws obtained above can be given a qualitative interpreta- 
tion along the following lines. During the time interval t after the explosion, 
the mass swept up by the fluid will be about pı R(t)? and the fluid velocity 
will be about R(t)/t. Therefore, the kinetic energy of the swept up gas will 
be pı R°t?. On the other hand, the internal energy density will be of the 
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order of p1 R? giving the total internal energy inside the expanding shock 
to be pı R? R3. Equating either of these energies to the total energy E will 
lead to the scaling relation in Eq. (3.61). 


3.3.4 Turbulence 


So far we discussed the flow of fluids ignoring their viscosity. In real life, 
viscosity can introduce some non trivial effects.A dimensionless parameter 
of considerable importance in the study of flow of viscous fluids is the 
Reynolds number R = (UL/v) where U and L are the characteristic velocity 
and length scales of the flow and v = (u/p) is the kinematic viscosity. The 
equations for viscous flow of a fluid can — in principle — be solved when the 
flow is steady, for any given boundary condition, for all values of R. Such 
formal solutions, however, may not actually be realized in nature because 
most of them are unstable. When the steady flow is unstable, any small 
perturbation will grow in time and destroy the original characteristics of 
the flow. 

The mathematical investigation of this — rather generic — instability 
is extremely difficult. It is certain that steady flow is stable for sufficiently 
small Reynolds numbers. Experiments in fluid mechanics suggest that when 
the Reynolds number R increases, we eventually reach a critical value Rerit 
beyond which the flow becomes unstable; it is usual to say that turbulence 
sets in at R = Ret. The value of Rerit depends on the nature of the flow 
as well as the boundary condition and seems to be in the general range 
of (10 — 100). This issue is of tremendous practical importance in, say, 
estimating the drag force which acts on a material body moving through 
the fluid (say, an airplane). For example, the drag force per unit length, 
farag acting on a cylinder of diameter d, moving with speed U through a 
fluid of density p can be expressed in the form farag = Carag(R)(pU7d/2) 
where Carag( P), called drag coefficient, is only a function of the Reynolds 
number. 

Though a highly turbulent flow cannot be analyzed quantitatively from 
the basic equations of fluid mechanics, it is possible to make some progress 
by introducing reasonable physical assumptions regarding the behaviour of 
energy in the turbulent flow. We assume that, in the mean rest frame of 
the fluid, the turbulent velocity is homogeneous and isotropic and can be 
described in terms of coexisting “eddies” of different sizes. Let A be the 
size of a generic eddy and let v) and e) denote the typical change in the 
velocity (across the eddy) and energy per unit mass contained in the eddy. 


118 An Invitation to Astrophysics 


At very large scales, it seems reasonable to assume that viscosity does not 
play any crucial role and the energy which is injected into the system is 
merely transferred to the next level of smaller eddies. The rate at which 
energy can cascade from larger scales to smaller scales is given by 


3 

¿x~ (32) (>) ~ a, (3.63) 
The first factor gives the energy per unit mass in the eddies of scale À 
and the second factor (A/va) t gives the inverse time-scale at which the 
transfer of energy occurs at this scale. (This transfer of energy arises due 
to the nonlinear term (v - V)v in the Euler equation.) In steady state, 
the energy injected into the system at large scale L has be transfered to 
some small scale À,, at which it can be dissipated as viscous heat. For the 
intermediate scales \, < A < L, the energy cannot be accumulated and 
must be transfered at a constant rate é all the way down to the smallest 
scales; it then follows that (vł$/A) is a constant in the intermediate scales 


allowing us to write 
\1/3 
DAU (2) (3.64) 


where U is the velocity at some macroscopically large scale L. Note that the 
largest eddies carry most of the velocity but the smallest ones carry most of 
the rotation 2 = V xv (called vorticity), which is about Q œ va /AÀ « A72/8, 


To estimate the smallest scale As at which this process stops we have 
to calculate the energy dissipation due to viscosity and equate it to é. The 
above analysis shows that v) = (er)! 3. If the shear o of the velocity field 
at some scale A is of the order of (v)/A), viscous heating rate at scale A will 
be 


2 
hy ~ no? n (>) ~ në? 3A74/3 (3.65) 


where 77 is the coefficient of viscosity. At large scales h is negligible com- 
pared to € and hence the same amount of energy will be transmitted from 
scale to scale. As we reach smaller scales, viscous heating becomes impor- 
tant and when h) œ pė, at some scale À = Às, the cascading of energy 
stops. Equating the expressions for h) and pė = p(U3/L) we find 


3/4 3/4 
ee ee wi Oy ese 
we (Gia) lm) =a Goo 
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where R is the Reynolds number evaluated at the largest scale. As long 
as the Reynolds number is large, we have Às < L the range of scales 
As <A < L, over which Eq. (3.64) is valid, is quite large. 

More formally, one can describe the turbulent velocity field in terms of 
the power spectrum or the correlation function. If the velocity field is v(x), 
then the (scalar) correlation function of v can be defined as 


E(x) = (v(x +y) - v(y)) (3.67) 


and the power spectrum S(k) is defined to be the Fourier transform of €(x). 
From Eq. (3.64) we have v(x) ~ é!/3z1/3 so that we expect €(x) to scale as 
E(x) x ¿?/3x?/3. The Fourier transform of €(x) will vary (with k ~ £7!) as 


S(k) x w3€ (ax) œ €7/3 4/3 o é?/3kT 13, (3.68) 


This is known as Kolmogorov’s spectrum. Usually, what is quoted in the 
literature is not the 3-dimensional Fourier transform S(k) but the “one 
dimensional” power, defined as k?S(k) which varies as k~5/3, You will 
often hear that Kolmogorov spectrum has an index 5/3, referring to this 
quantity. 

One application of this spectrum is in the modeling of atmospheric 
turbulence. The atmosphere consists of large cells of gas at very different 
temperatures and turbulence mixes the hot and cold air down to very fine 
scales. The resulting temperature distribution has a Kolmogorov spectrum 
and two parcels of air separated by distance | will have rms temperature 
difference of about ((6T/T)*)!/? ~ 5 x 1075(1/1 cm)!/3. (See Eq. (3.64)) 
which explains the (1/3) scaling). This leads to fluctuations in the refractive 
index of the air. If n is the refractive index of air, then (n — 1) x px TT? 
so that ôn = (n—1)6T/T. Since (n — 1) = 3 x 1074 for air in visible band, 
ôn = 1.5 x 1078(1/1 cm)!/3, 

When a wavefront arriving from a distant source encounters a turbu- 
lent medium, with a randomly varying refractive index, the effective flux 
received by an observer will show spatial and temporal variations. The 
relative phase difference induced on the light ray crossing a blob of air of 
size | will be 6¢ = 2r(l/A)ôn. If the scale height of the atmosphere is 
D = 10 km, then the light will pass through about N = D/I blobs, each of 
which will induce a random phase of about 6¢. The net phase difference is 
Að = VNb¢ = 2nVN(L/A)bn. Using our result for ôn, we get 


AG = 2n(1.5 x 1078) D!/2 15/6 à~! (3.69) 
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when |, D and X are expressed in cm. We will expect coherence to be 
lost when A® œ~ xvx, which occurs at a critical length scale of about 
le = 10(A/5000 Å)6/5 cm. This is the characteristic scale of fluctuations 
which has significant effect on light transmission; note that this scale is 
strongly achromatic with a \°/5 dependence on the wavelength. The angu- 
lar resolution of a ground based telescope, for example, will be A@ = A/R 
where R is the aperture of the telescope, only if R <l,.. When R > le, 
you are not limited by the aperture but by the atmospheric fluctuations, 
so that A@ = A/l.. For À = 4000 A, le = 10 cm, we get A@ = 1” which is 
the intrinsic limitation of ground based telescopes. (This is referred to as 
seeing in astronomy.) 

This random phase A® will tilt the wave front thereby making a plane 
wave surface corrugated. The tilt angle is just A@ = (A®/2n)(\/l) = 
1.5 x 1075171/6 (for D = 10 km with À expressed in cm). The Kolmogorov 
spectrum continues down to about 1 ~ 1 mm with most of the effects 
arising from the fluctuations in the range 1 mm to 5 cm. Because of the 
weak dependence on l, we can assume that the typical angular scale at 
which atmospheric turbulence starts smearing an image is AO ~ 10-5 rad 
= 4 arcsec. Incidentally, the angular size of a star is much less than A@ so 
that it will twinkle due to the atmospheric turbulence; planets, on the other 
hand, subtend angles much larger than A@ so that they do not twinkle. As 
you can see, the explanation for this well known fact is rather non trivial. 


3.4 Basic plasma physics 


In several astrophysical contexts, one needs to deal with matter in the state 
of complete or partial ionisation. When the gaseous system has significant 
amount of ions and free electrons, it is usual to call it a plasma and such 
a medium exhibits a wide variety of electromagnetic phenomena. We shall 
concentrate on some of them in the next few subsections. 


3.4.1 Ionisation equilibrium of plasma 


Neutral hydrogen atom, in its ground state, contains an electron and proton 
bound together with an energy of Eg = —13.6 eV which is equivalent to a 
temperature of Ty ~ 1.6 x 10° K. At temperatures T ~ Ty, fair fraction of 
the atoms will be ionized thereby leading to a composition made of protons, 
electrons and neutral hydrogen atoms. Our first task is to determine this 
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ionisation fraction as a function of density and temperature. 

Let np,Ne and ny denote the number densities of protons, electrons 
and neutral hydrogen atoms (with np = Nne) and let the total number 
density be n = np + ny. The ionisation fraction is defined to be the ratio 
Y = [np/(np + nH)| = (np/n) = (ne/n). We are interested in determining 
y as a function of the temperature T and density n. For a bunch of non 
relativistic particles in thermal equilibrium, the number density n as a 
function of energy E can be expressed as 





3 Qh? 1/2 1 
~ dT —GE); A= f = (kgBTy 3.70 
n = A" exp (—BB) (2) > B= teTy? (37) 
where À (called thermal wavelength) essentially decides the number of phase 
cells per unit volume with à~? œ (p/h)® where p ~ (2mkgT)"/? is the mean 
momentum at temperature T. Applying Eq. (3.70) to the electrons, protons 
and neutral hydrogen atoms involved in the reaction p + e => H, we get 





3 
= (5) eW PEI w \z3e7PEr (3.71) 
where Ey is the ionisation potential for the atom. In arriving at the last 
relation, we have set mp ~ my in the pre-factor. To be precise, the right 
hand side of Eq. (3.70) should be multiplied by the degeneracy factor, gi, 
for each constituent. But ge = 2,9) = 2 and gy = 2 x 2 (due to spins) 
and this factor cancels out in Eq. (3.71). We will comment on this factor 
in detail later on (see Eq. (3.76) below.) 

Using the definition of the ionisation fraction y = (np/n) = (ne/n), 
Eq. (3.71) can be written as 


y? E. Mekal 
Qh? 





l-y n 





3/2 ' 1 
S Eo — 
) e7PlEol — mag “P(-pEr). (3.72) 


This relation is called Saha’s equation. You see that the pre-factor (n\3)~! 
is the ratio between the number of phase cells available in a given volume 
to the number of particles in that volume. Our classical discussion is valid 
only if this factor is large compared to unity. For pure hydrogen, Eq. (3.72) 
becomes 





2 -9 5 
y? _ AOL x 107? 9/2 oxp (= x10 ) (3.73) 


1l-y p T 


where pisin gm cm~® and T is in Kelvin. To understand the temperature 
dependence of ionisation better, let us determine the temperature at which 


122 An Invitation to Astrophysics 


half the atoms are ionized [ie., y = (1/2)] as a function of the density. 
Setting y = 1/2 gives the following curve in the p — T plane, 


ae 24 14 E E 
pilt) = Sr gm m" \ Tex IOK į T 
(3.74) 


which is shown in Fig. 3.4. 





-30 -25 —20 -15 -10 -5 0 


10910 P 


Fig. 3.4 The ionisation structure of pure hydrogen plasma in thermal equilibrium at 
different temperatures. The solid line corresponds to an ionisation fraction of y = 0.9 
and the dashed line corresponds to y = 0.5. 


You see that the temperature changes only by a factor 3 or so when the 
density changes by about 15 orders of magnitude! Thus, for a wide range 
of interesting densities, y = 1/2 corresponds to a temperature of about 
104 K. Also note that significant ionisation (50 per cent in the above case) 
occurs at a temperature which is well below the temperature corresponding 
to the binding energy Eo. This is a direct consequence of the fact that 
(nd3)~1 >> 1 and can be seen from the condition y = 0.5 expressed in the 
form 


2 -1 
kpTos = Er [m Al < Er. (3.75) 
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The temperature at which half the atoms are ionized, T,/2 is of the order 
of kgTy/2 © |Eo|/10 which is a good thumb rule. 

Similar analysis can be performed for more complicated ionisation sce- 
narios. If E; is the energy needed to remove an electron from an atom 
changing its ionisation state from 7 to (¢+ 1), then the corresponding result 


is 
mist _ (ge (Zit (mekeT\*” ox, 
ni -(£)( Zi )( Onhe ) 4 
= (287) (Za) (zr en am) 
P, Zi 20 h2 

where we have introduced the effective degeneracy factors Zi, Zi+1, ge = 2 
and written ne = (Pe/kBT). The Z; is the effective degeneracy factor 
computed by summing gn exp(—ßEn) over all states of the atom. 

In the discussion above we have assumed that there is a well-defined 
notion of temperature which is the same for electrons and ions in a plasma. 
There is some subtlety in this assumption because these particles have 
different inertia. To see this recall that thermal equilibrium is reached by 
energy transfer due to collisions. For a neutral gas of molecules, this is 
essentially determined by molecular collisions with the cross section oo % 
nag ~ 8.5 x 1071" cm? and the corresponding mean free path | = (nao)~!. 
The time scale for establishment of Maxwellian distribution of velocities 
will be about Tneu œ l/v x n7!T71/2, For an ionized classical gas, on 
the other hand, the cross section for scattering is decided by Coulomb 
interaction between charged particles. Since an ionized plasma is made of 
electrons and ions with vastly different inertia, the inter-particle collisions 
can take different time scales to produce thermal equilibrium (i) between 
electrons, (ii) between ions and (iii) between electrons and ions. Each of 
these need to be discussed separately. Consider first the time scale for 
electron-electron scattering. The typical impact parameter between two 
electrons is b ~ (2q?/m,v*) where v is the typical velocity of an electron. 
The corresponding electron-electron scattering cross section is 














2\2 -2 
oe Vee q L ngo T 
Ocoul © TD X T (£) Ja = 10 cm (3 z) (3.77) 
and the mean free path varies as l = (ngcoul)™} œ (T?/n). The mean 
free time between the electron-electron scattering will be Tee œ~ (nav)! 
where n is the number density of electrons and o œ% 7b?. This gives 
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Tee © (m2v*/2mq*n) which is the leading order dependence. A more precise 
analysis changes the numerical coefficient and introduces an extra logarith- 
mic factor just as in the case of gravitational scattering studied in chapter 
1; see Eq. (1.65). So a more precise value is (Tee/L) where L ~ 10. Since 
T x my? x T3/2m'/? at a given temperature T œ (1/2)mv? the ion - 
ion collision time-scale Tpp will be larger by the factor (mp [Me] ~ 43 
giving Tpp = (mp/me)*/ 27.6 ~ 43Tee. Finally, consider the time-scale for 
significant transfer of energy between electrons and ions is still larger be- 
cause of the following fact. When two particles of unequal mass scat- 
ter off each other, there is no energy exchange in the center of mass 
frame. In the case of ions and electrons, the center of mass frame dif- 
fers from the lab frame only by a velocity vom ~ (Me/Mp)? vp < Up. 
Since there is no energy exchange in the center of mass frame, the max- 
imum energy transfer in the lab frame (which occurs for a head-on col- 
lision), is about AE = (1/2)mp(2vcm)? = 2mpvgy ~ 2mev? giving, 
[AE/(1/2)mpv?] ~ (me/mp) < 1. Therefore it takes (mp/me) times more 
collisions to produce equilibrium between electrons and ions. That is, the 
time-scale for electron-ion collision is Tpe = (Mp/Me) Tee ~ 1836Tee. The 
plasma will completely relax to a Maxwellian distribution in this time-scale. 

In a high temperature plasma discussed above, it is the collisions be- 
tween the atoms in the high energy tail of the Maxwell distribution that 
cause the ionisation. The electrons can also be stripped off the atoms in 
different context. This occurs if the matter density is so high and the atoms 
are packed so close to each other that the electrons form a common pool. 
The Fermi energy of these electrons will be comparable to the ionisation 
energy of the atoms: er 2 Er. In this case, the electrons will be quantum 
mechanical and the relevant energy scale for them will be ep. The temper- 
ature does not enter into the picture if kgT < er and we may call this a 
zero temperature plasma. (More conventionally, such systems are, of course, 
called degenerate.) For normal metals in the laboratory the Fermi energy is 
comparable to the atomic binding energy within an order of magnitude. If 
the temperature is below 10* K, the properties of the system are essentially 
governed by Fermi energy. 

You will recall that the derivation of equation of state based on the 
pressure given by Eq. (3.3) assumed that the neutral gas is ideal; i.e., 
the mutual interaction energy of the particles is small compared to the 
kinetic energy. To treat a plasma as ideal, it is necessary that Coulomb 
interaction energy of ions and electrons is negligible. The typical Coulomb 
potential energy between the ions and electrons in the plasma is given by 
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Ecout Œ Zq?n'/3 since the typical inter-particle separation is n~1/3. If 
the classical high temperature plasma is to be treated as an ideal gas, 
this energy should be small compared to the energy scale of the particle 
e ~ kpT, which requires the condition nT~3 « (kg /Zq?)? ~ 2.2 x 1087-73 
{in c.g.s. units) to be satisfied. On the other hand, to treat the high density 
quantum gas as ideal, we should require that the Coulomb energy e€coy! is 
small compared to the non-relativistic Fermi energy er œ (h?/2m)n?/3. 
The condition now becomes n > 8Z°a9° = Z? x 1076 cm~3. You see that 
such a system becomes more ideal at higher densities; this —- somewhat 
counter intuitive — result arises because the Fermi energy rises faster with 
density than the Coulomb interaction energy. 


3.4.2 Electromagnetic fields in plasma 


Since plasma has free charged particles, it has very complex electromagnetic 
properties, most of which play some role or the other in astrophysics. We 
shall discuss a few of these phenomena. 

The simplest non trivial electromagnetic phenomenon that occurs in a 
plasma is called shielding. You know that, in the vacuum, the Coulomb 
field of a charged particle has long range and falls as r~*. But if a charged 
particle is introduced into a plasma, its electric field decays exponentially 
with distance so that one could say the electric field is ‘shielded’ by the 
plasma. This arises because a positively charged particle, say, introduced 
into the plasma will polarize the medium gathering around it a cloud of 
negatively charged particles; the net effect of which is to reduce the effective 
charge as seen from a distance. The shielding would have been perfect at 
zero temperature but the finite temperature of the plasma will not allow the 
negatively charged particles to completely neutralize the positive charge. 
Hence, at non zero temperature we get an exponential decay of the electric 
field with a finite correlation length. f 

This result can be obtained fairly easily. The Poisson equation for the 
plasma, with an extra charge Q added at the origin will be 


V76 = —4rq (n4 — n_) — 40Q6 (r) (3.78) 


where n(x) is the number density of ions (electrons) in the plasma. In 
thermal equilibrium we can assume that n+ œx exp(¢¢/kpT) giving 


n4 - n- = ñ [exp (—q¢/keT) — exp (q¢/kaT)) (3.79) 


where ñ is the uniform density of particles in the absence of ¢. Far 
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away from the charge Q, we will have (qġ/kBT) < 1 and ny —n_ ~ 
~(2ng/kgT)¢d. Hence Eq. (3.78) becomes 


8rg ñ 





2j E 
Vo= kaT o — 4nQ6 (r) (3.80) 
which has the solution 
_Q v2r 
o= a Noe es (3.81) 
where Ap = (kBT/ Amgen)!” is called Debye length. Numerically, 
TNP; n yi 


The corresponding time-scale is given by tp = (Ap/v) where v is the typ- 
ical velocity of the electron. This is the time it takes for the electrons 
to move over the distance Ap neutralising a random fluctuation. Using 
v? œ (kpT/m) we get 


2 kBT m m _2 

tp = (Ez) (=n) = angen Wp (3.83) 
where wp is called the plasma frequency. You will see that it plays an 
important role in the propagation of electromagnetic waves in plasma. 

For a plasma with number density n, the mean inter-particle distance is 
n~1/3 and the typical electrostatic energy of nearest neighbours is q?n!/3. 
The ratio R between this electrostatic energy and the thermal energy kgT 
is 





2,1/3 3 1/3 
Tea lL x NA 
R= kaT x ren! ~ (4) No| x Np. (3.84) 
Hence condition Np > 1 implies that the typical electrostatic energy is 
small compared to kgT which will ensure that the plasma will behave 
collectively as an ideal gas. Numerically, 


TNA n K-12 
weel) (8). e 
p = (nd3,) ~ 360 ( x) —_ (3.85) 
We shall next consider two important electromagnetic wave phenomena 
that occur in a plasma. The first one is called plasma dispersion which 
introduces a frequency dependent velocity of propagation for electromag- 
netic radiation thereby making radiation of different frequencies propagate 
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with different speeds through a plasma. The second effect is called Fara- 
day rotation which involves a rotation of the plane of polarization of an 
electromagnetic wave that is propagating along the direction of a magnetic 
field embedded in the plasma. Both these effects have been used as useful 
diagnostics of astrophysical plasma and magnetic fields. 

Let us begin with the concept of plasma dispersion. Consider an elec- 
tromagnetic field which is propagating through a plasma and assume that 
all the relevant quantities vary in space and time as expi(k-x — wt). In 
free space, k = |k| and w will be related by w = kc and both the phase 
velocity vph = w/k and the group velocity (dw/dk) will be equal to speed 
of light and a constant. The situation changes in the presence of a plasma 
and the new relation between w and k can be determined as follows. The 
Maxwell’s equations in Fourier space has the form 


ik-E =4mrp, ik-B=0, 
ik x E = i(w/c)B, ik x B = (4r /c)j —i(w/c)E (3.86) 


in which all quantities are in the Fourier space. The p, for example, stands 
for p(t, k) which is the spatial Fourier transform of p(t, x) and similarly for 
all other variables. We will assume that, to the first order of approximation, 
most of the current is produced due to the motion of electrons relative to 
ions with the latter treated as stationary. Then the equation of motion for 
the free electron, mv = —qE, has the solution given by v = gE/(iwm). 
(Remember that we are working in Fourier space and — at the end — we 
will have to take the real part of all the variables; a relative i factor merely 
indicates that two physical quantities are out of phase.) This will corre- 
spond to a current j = —nqv = cE where o = ing?/wm. This expresses 
the current in terms of the electric field. Using the charge conservation in 
Fourier space, we have the relation wp = k -j allowing us to express the 
charge density as p = (0 /w)k- E. We can now substitute for p and j in the 
Maxwell’s equation Eq. (3.86) and express them entirely in terms of electric 
and magnetic field as 


ik-eE=0, ik-B=0, 
ik x E = i(w/c)B, ik x B = —i(w/c)eE (3.87) 


where 


€=1-(wp/w)? (3.88) 
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with w, being the plasma frequency introduced earlier in Eq. (3.83); nu- 
merically, 


m 


Arne \ \/? 
wp = (sar) = 56.3 x Vn kHz (3.89) 


if n is in CGS units. By taking the cross product of the third equation in 
Eq. (3.87) with k and using the other equations, it is easy to obtain 


ek =w? — we. (3.90) 


This is the modified dispersion relation for electromagnetic radiation in a 
plasma in the simplest possible context. There are two important conclu- 
sions which arises from this result. 

First, note that k becomes purely imaginary for low frequency radiation 
with w < wp. Suppose an electromagnetic wave with a frequency w < wp 
impinges on a slab of plasma. The wave mode inside the plasma will decay 
exponentially with distance because k is pure imaginary. In other words, 
low frequency radiation cannot penetrate into plasma and will be reflected 
back completely. It is this feature which allows short wave radio broadcasts 
to be reflected back by the ionospheric plasma allowing radio reception 
beyond the line of sight horizon. The characteristic cut-off frequency in the 
ionosphere is about v ~ (10 — 15) MHz. By the same token, one cannot 
receive low frequency radiation in radio band from distant astronomical 
sources — which is the result we mentioned in page 80. 

The physical reason for this phenomenon is easy to obtain. When elec- 
trons are displaced relative to ions because of the action of the external 
electromagnetic field, it will generate a field which opposes the applied 
electric field thereby attempting to reduce the net field to zero. When this 
process is effective, the electromagnetic radiation cannot propagate through 
the region. If we imagine all the electrons in a fictitious area A to be dis- 
placed by a distance dz normal to the area element, then it will generate 
an electric field 420 where o = n,6z is the surface charge density in the 
area element. The equation of motion for an electron in this electric field 
will be m(d?/dt?)éz = —4mrqo = —41n.q?6zx. This leads to oscillations of 
displaced charge densities at the plasma frequency w, showing that this 
is the characteristic time scale of plasma oscillations. On the other hand, 
the characteristic time scale at which electric field varies is w~!. When 
wz! < w7", the electrons can quickly readjust themselves opposing the 
electric field and preventing propagation. This is why, for wp > w the 
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waves are exponentially damped in a plasma. 

Let us now move on to another effect is related to the group velocity of 
those (high frequency) waves which do manage to propagate through the 
plasma. Using Eq. (3.90) for w > wp, we find that the group velocity is now 
given by vg = (ðw/ðk) = c(1 — w2/w?)'/?. If such a group of waves are 
traveling from a cosmic source towards earth, through a plasma, the travel 
time to cover a distance r will be 


"ds "ds we R 
w=-[/2-[2 1-4 , (3.91) 
o % Jo 


For w >> wp, we can use the Taylor expansion of the square root and the 
definition of w? to get 
r  2nq? 


ty = - + — 
C Mew 





(DM), (3.92) 


where the quantity DM, which astronomers call the dispersion measure of 
the plasma, is defined to be 


pM = f Neds. (3.93) 
0 


This quantity gives the column density of electrons along the line of sight. 
Using Eq. (3.92) we can relate the difference in the time of arrival, Arp, of 
a signal at two different frequencies to the dispersion measure. Numerically, 
in the usual units of the astronomer, (n in cm~* and 1 in parsecs), we have: 


DM = Atp Vj =2 V2 —2 =l 
——,—- = 2.4x 1074 (| —= —- — (——— . (3.94 
cm? pe ae ( us ) |G MT) Gi Ta) | eee) 


If the time delay at the two frequencies can be measured then this equa- 
tion allows for the dispersion measure to be determined along any given 
direction. 

As an example, consider a source at a distance 1 kpc from which one 
is receiving radiation that is propagating through a plasma with electron 
density 0.05 cm~3. The dispersion measure will be DM ~ 50 pe cm™è?. 
Radiation at the two different frequencies vı = 1.7 GHz and v2 = 1.5 GHz 
(in the radio band) will show a time delay of about tı — t2 = —0.021 seconds 
which is easily measurable. 

We next consider another phenomenon related to the propagation of 
waves in magnetized plasma (called Faraday rotation). This arises when a 
plane polarized electromagnetic wave propagates along the direction of a 
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magnetic field Bo in a cold plasma. The equation of motion for an electron 
in the plasma is now given by 


mË 


mg = —qE — 4 v x Bo. (3.95) 


Taking the magnetic field to be along the third axis fig and the electromag- 
netic wave to be circularly polarized, we have 


E(t) = Ee~™* (ñ; F if). (3.96) 


From Eq. (3.95) we find that the steady state velocity induced on the elec- 
trons will be 


-iq 
—— E(t .97 
v(t) = APERTI (t) (3.97) 
where wg = qBo/mc is the cyclotron frequency of the non relativistic elec- 
tron. You can now go through the same analysis as before by finding the 
current, charge density etc. and you will find that the plasma behaves as a 
system with two different dielectric constants €g and ez for the right and 
left circularly polarized waves with 
1 wp 3.98) 
eR L = 1 - ——. : 
on w(w +wp) ( 
Any linearly polarized wave can be represented as the sum of two circularly 
polarized waves. Since the speeds of propagation of the two circular polar- 
izations are different, the plane of polarization will be rotated as the wave 
propagates along the magnetic field. For w > wp,w >> wp, the net rotation 
while propagating a distance L along the magnetic field can be expressed 
as 


L 3 L 3 
2re 27e 
= f Ak dz= anna S, Ne Bo dz= m2 (RM) (3.99) 


where the last equality defines the quantity, RM, called rotation measure. 
If the propagation is not strictly along the magnetic field, it is the compo- 
nent of the magnetic field, By, along the direction of propagation, which 
is the relevant quantity in defining the rotation measure. Numerically, in 
conventional astronomical units 


Ay Py ese ae aa Bi Ne A 
= 1 = r fir à 
rad era (a 3 [ Gauss e g (=) (100) 
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As in the case of dispersion measure, observations of Faraday rotation at 
two different frequencies can be used to determine the rotation measure. 
This serves as a diagnostic of astrophysical plasmas. 

Finally, we want to consider the behaviour of magnetic field in a moving 
plasma, and show that — in a highly conducting fluid — the magnetic field 
is “frozen” to the fluid and moves along with it. To do this, consider a 
closed curve C which is ‘flowing along’ with the plasma in the sense that 
every point in the curve is shifted by an amount vdt in space in a small 
time interval dt where v is the velocity of the fluid element at that point. 
In general, the shape of the closed curve will change with time and we are 
interested in the rate of change of magnetic flux 


o= fB -dS (3.101) 
S 


through a surface S bounded by the curve C. This flux will change both 
because of intrinsic time variation of the magnetic field as well as due to the 
fact that the curve C is moving in space. The first effect is easy to compute 
and is given by 


dö 3B 
(Z) - T ds=- f(VxE) dS =— $ B-dl (3.102) 


To obtain the change due to the motion of the loop, notice that the area 
covered by a line element dl, moving with velocity v, in a small time interval 
dt is dA = vdt x dl. This leads to the change of flux through the curve C 
given by 


db =B-dA=B.: (vdt x dl) = (B x v) - dldt. (3.103) 


Adding up all the infinitesimal line elements d1 of the curve and dividing 
by dt, we get the flux change due to the second process to be 


Can z - pw x B) - dl. (3.104) 


Adding the effects in Eq. (3.102) and Eq. (3.104), the net change of flux 
through the loop will be 


dð (dd dd 
2- (F) (F), -f E+vxB) a (3.105) 


But notice that, in the non relativistic limit, v « c, the quantity E’ = 
E + (v x B) is the electric field in the rest frame of the plasma. If the 
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plasma is highly conducting, we must have E’ = 0. (This is because even 
a small electric field will induce a large current if the conductivity is large 
and the current will redistribute the charges making E’ ~ 0.) Hence we 
find that (d®/dt) = 0. That is, the magnetic flux through every closed 
curve in a conducting plasma remains constant as the plasma moves. 

It is also possible to show from this result that any two elements of 
plasma which were on the same magnetic field lines at some instant of 
time, will remain glued to the same magnetic field line at all times. This 
allows one to think of magnetic field lines as frozen into the plasma and 
following the motion of the plasma. 


Exercises 


Exercise 3.1 Getting degenerate: 


(a) How degenerate are you? (b) What about electrons in sodium metal at room 
temperature? 


Exercise 3.2 Jupiter and Saturn: 


The internal structure of planets like Jupiter or Saturn is thought to be described 
reasonably well by an equation of state P = Kp? corresponding to n = 1 poly- 
trope which can be exactly solved. Use the known masses and radii of the planets 
to compute other physical parameters. Is the observed M, R values for Jupiter 
and Saturn consistent with this equation of state? 


Exercise 3.3 One dimensional isothermal profile: 


The one dimensional analogue of an isothermal sphere will satisfy the equation 
(d?6/dz?) = 4rGpo exp[—(¢/c”)]. Show that this equation has the exact solution 
p = po sech? (z/2z0) where z2 = (0? /8rGpo). 


Exercise 3.4 What fixes the accretion rate? 


The result we obtained in Eq. (3.46) shows that the flow rate M is uniquely 
fixed at a value dependent only on M,cs(0o), p(0o) implying a gas can flow to a 
compact object only at a specific rate. The purpose of this exercise is to provide 
an explanation for this result which should have surprised you in the first place. 
(a) Show that when M changes slightly, the velocity and density at any fixed 
radius r changes by the amount 


F 3 2 
ôM _ (: a =) O on (3.106) 
v p c2 v 
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Argue from this result that at any given r, there is a maximum permitted flow 
rate which occurs when v? = c2. (b) Determine the maximum value |Minax(r)| in 
terms of other parameters in the problem. (c) Show that, for 1 < y < 5/3, the 
|Mmax(r)| decreases as A 3)/20—1) for small r, reaches a minimum value Merit 
at r = re and increases as r° for large r. (d) Argue from the above conclusion that, 
in order to have a flow which makes the transition from sub-sonic to supersonic, 
we must have exactly M = Merit. 


Exercise 3.5 Shocking motion: 


Ionized hydrogen, which is getting accreted on to the surface of a white dwarf 
of radius R, is moving through a shock located close to the surface. Show 
that the shock temperature can be written in a suggestive form as kBT % 
mec? (3mp/32me)(2GM/c?R). Assume that the pre-shocked matter is cold. 


Exercise 3.6 Getting it totally wrong: 


(a) At the center of the Sun, T = 1.5 x 10’ K and p = 100 gm cm~*. Estimate 
the ionisation fraction using Saha’s equation. You will find that the center of the 
Sun should be nearly 25 per cent neutral. Do you believe this? (b) Intergalactic 
medium has T = 2.7 K and p = 1077? gm cm~®. It should be almost completely 
neutral while observations suggest that it is highly ionized. What is going on? 
[Hint: (a) Compute the pressure and inter-particle separation at the center of the 
Sun. (b) Remember that Saha’s equation was obtained using the assumption of 
thermodynamic equilibrium] 


Exercise 3.7 Polarization of CMBR: 


Recently observations at 1 GHz has revealed polarization of the cosmic microwave 
background radiation at approximately the expected strength. If the universe 
had a strong magnetic field, the direction of which varies significantly at angular 
scales smaller than the beam of the detector, then this signal would have been 
destroyed by Faraday rotation. This allows us to put a bound on the magnetic 
field strength. Assume the universe was fully ionized with a number density of 
2 x 1078 cm™? and the relevant size of the universe is 1078 cm. Obtain a rough 
upper limit on the net parallel field strength at scales smaller than the beam size. 


Exercise 3.8 Relaxation in real life: 


A gaseous region of ionized hydrogen, usually called a HII region, has an electron 
number density ne = 100 cm7? and kinetic temperature of T = 10‘ K. The 
age of such a region is typically 10” yrs. Estimate the time scale for electrons 
and protons to achieve Maxwellian distribution individually and the time scale 
for the temperatures to be equalized. Assume that the logarithmic factor is 
L = 25. Repeat the same analysis for a supernova remnant with ne = 1 cm~3 
and T = 5 x 10” K and an age of about 400 yrs. [Ans: For the HII region, 
teqe—e © 4X 1078 yrs; teq,i—i œ 1.6 x 1074 yrs; teq,i~e & 7 x 107? yrs. In the case 
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of supernova remnant, teq,e-e % 10” yrs; teq,i-i © 4 x 10° yrs; teqi-e © 2 x 10° 
yrs.| 


Notes and References 


A good text on fluid mechanics is Landau and Lifshitz (1975b). Astrophysical 
fluid dynamics is covered in, for example, Shore (1992). Figure 3.3 is adapted 
from chapter 8, Padmanabhan (2000). 

1. Section 3.2: Detailed discussion of isothermal sphere and its stability can 
be found in Padmanabhan (2000), chapter 10 and in Padmanabhan (1990). 

2. Section 3.8.4: For a more rigorous discussion of fluid turbulence, see for 
example, Landau and Lifshitz (1975b) 


Chapter 4 


Stars and Stellar Evolution 


From this chapter onwards, we shall use the concepts developed in chapters 
1-3 to study different astrophysical systems, starting with the physics of 
stars. Historically, stellar astronomy pioneered the study of astrophysical 
systems and it is no surprise that stars are the astrophysical objects about 
which we have the most comprehensive understanding. To a great extent, 
this is facilitated by the fact that a large sample of stars can be studied 
with reasonable accuracy in our own galaxy. 

As a consequence, stellar physics also determined the terminology and 
jargon (often unfortunate and confusing) which is used in the rest of as- 
trophysics. For example, the division of stars visible to naked eye into a 
set of bins called magnitudes, led to brightness of any celestial object being 
measured by this unit; all elements heavier than hydrogen and helium are 
traditionally called metals by stellar astronomers and this convention (in 
which carbon and nitrogen are termed metals!) still continues; stars are 
classified by a random sequence of alphabets etc. While these traditions 
are unlikely to be replaced by more rational ones in the near future, it 
cannot be denied that stellar physics also set the tone for understanding 
astrophysical systems based on concepts from basic physics. It is this theme 
with which we will begin our exploration. 


4.1 When is gravity important? 


Let us begin by considering the electromagnetic and gravitational binding 
energies of a body of mass M and radius R. The atomic binding energy of 
normal matter is purely electromagnetic and is about (q?/ao) per particle 
where q is the electronic charge and apg is the atomic radius. If the atoms are 
closely packed (with the number density n satisfying the relation na œ 1), 
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the atomic binding energy (per particle) of a system is about €a © (¢?/ao) © 
q’?n'/3, On the other hand, the gravitational energy per particle is 

_ GM? __ Gm5N 

“= "NR Na 





~ GMN?” n3 (4.1) 


where N ~ (M/mp) is the total number of particles in the body, and we 
have again assumed the atoms of mass m, are closely packed with N = na’. 
You can immediately see a crucial difference between gravitational binding 
energy per particle and atomic binding energy per particle. The latter 
€a œx n}/3 depends only on the density and is independent of the total 
number of particles in the system. But in the case of gravitational binding 
energy eg x N?/3n!/3 showing the binding energy per particle itself depends 
on the total number of particles N. The total energy of the system at given 
density varies as Neg « N in the case of atomic binding (which is extensive, 
so that two litres of petrol give you twice as many miles as one litre of petrol) 
while Neg x N 5/3 in the case of gravity — which is clearly non-extensive. 
The ratio of the two binding energies is given by 


Ea a 1 We. 10n 
#~ (2) Ga) = (47) (42) 


where we have defined the ‘gravitational fine structure constant’? ag = 
Gm3/fic ~ 107° (by replacing q? by Gm? in the fine structure constant 
a = q’/hc). The laboratory systems, with N < 1054, have negligible 
gravitational potential energy compared to their atomic binding energy. 
Clearly, the number Ng = a°/ 2a ? = 1054, arising out of fundamental 
constants, sets the smallest scale at which the gravitational binding energy 
becomes as important as electromagnetic binding energy of matter. The 
corresponding mass and length scales are 


Mplanet ~ NeMp ~ 10°° gm , Rplanet = NY’ a0 ~10'° cm, (4.3) 


and are comparable to those of a large planet. For larger masses, grav- 
itational interaction changes the structure significantly while for smaller 
masses gravity is ignorable and matter is homogeneous with constant den- 
sity so that M œ Rè. 

Since we used Newtonian gravity to arrive at this conclusion, it is neces- 
sary to verify that the ratio of the gravitational energy to rest mass energy, 
(€g/M c?) is small for this scale; this ratio for M œ~ Mptanet, R ~ Rplanet 


is about at? a3/ 2(me/Mp) < 1. You see that the smallness of this ratio 
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— allowing planets to be described by Newtonian gravity — follows purely 
from the values of fundamental constants. 

Most of the astrophysically interesting systems have masses larger than 
Mpianet and require the gravitational force to be balanced by forces other 
than normal solid state forces. In general such systems can be classified 
into two categories. The first set has the gravity balanced by the classical 
motion (random or orbital) of the particles; while the second one has the 
gravitational pressure balanced by degeneracy pressure. For a compactly 
packed system with naz ~ 1, the non-relativistic Fermi energy of electrons, 
er & (h?/2m,.)n?/3 ~ (1/2m-)(h/ao), is clearly comparable to the atomic 
binding energy and one can compare either €a or ep with €g to see how the 
system is supported against gravity. (Remember that pressure is propor- 
tional to energy density; so balancing the pressures is the same as balancing 
energy densities.) This is meaningful as long as the temperature of the sys- 
tem (as measured by random velocities) is low and kgT « er œ €a & 10 
eV. If the temperature is significantly higher, then it is the thermal energy 
kgT which should be compared to €g. 

When the mass of the system increases, the gravitational pressure in- 
creases and — to balance it — both the degeneracy pressure and the ther- 
mal pressure will increase. The dynamics of the system will then depend 
on the relative significance of these two quantities. To take into account 
both thermal and quantum degeneracy contributions, we shall assume the 
matter pressure can be approximated by P = nkgT + ner which — while 
not exact — is an adequate interpolation between the two limits. This 
pressure can balance the gravitational pressure if the corresponding energy 
densities are comparable: (kgT + er) ~ Gm? N?/3n'/3_ Using Eq. (3.12) 
for ep for the non-relativistic electrons we get 


(372)? A? 2/3. 


2 Me 


kpT x Gm? N73 p13 — (4.4) 
For a classical system, the first term on the right hand side dominates and 
we see that the gravitational potential energy and kinetic energy (corre- 
sponding to the temperature T) are comparable; this is merely a restate- 
ment of the virial theorem we proved in chapter 1. If the system contracts 
under the self gravity and the radius R of the system is reduced, then the 
density n will increase. But note that the second term (x n?/3) on the right 
hand side of Eq. (4.4) grows faster than the first (x n!/3). So, as the sys- 
tem contracts, the temperature will increase, reach a maximum value Tmax 
and decrease again; equilibrium is possible for any value of the temperature 
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T < Tmax With gravity balanced by thermal and degeneracy pressure. The 
maximum temperature, Tmax, which the body can acquire, is reached when 
n = Ne, with 


a N2/3 a2 
ris aa (a) PTa = ages (Nme) (48 
where Äe = (ħ/mec). An interesting phenomenon arises if the maximum 
temperature Tmax is sufficiently high to trigger nuclear fusion at the center 
of the system; then we obtain a gravitationally bound, self sustained, nu- 
clear reactor usually called a star. Let us consider the condition for this to 
occur. 

For two protons to fuse together undergoing nuclear reaction, it is nec- 
essary that they are brought within the range of attractive nuclear force 
which is about | ~ (h/mpc). Since this requires overcoming the Coloumb 
repulsion, such direct interaction can take place only if the kinetic en- 
ergy of colliding particles is at least of the order of the electrostatic po- 
tential energy at the separation l. This requires energies of the order of 
e ~ (q?/l) = (a/2m)mpc? & 1 MeV. It is, however, possible for nuclear 
reactions to occur at even lower energies by quantum mechanical tun- 
neling through the Coloumb barrier. This happens when the de Broglie 
wavelength Adeg = (h/mpv) = I{c/v) of the two protons (which is clearly 
larger than l) overlap. This occurs when the energy of the protons is about 
uae S (Q?/ 2r? )Mmpe? æ 1 keV. It is conventional to write this expression 
aS Enucl S Na?Mpc? with n ~ 0.1. This quantity €nuci sets the scale for 
triggering nuclear reactions in astrophysical contexts. The energy kgT max 
corresponding to the maximum temperature obtained in Eq. (4.5) will be 
larger than €nuc) when 


3/4 3/2 
N > (2n)3/4(3n?)!/2 (=) (=) 4 x 10% (4.6) 
e 


for 7 ~ 0.1. The corresponding condition on mass is M > M, with 


3/4 3/2 
M, ~ (2n)8/4(3n?)1/2 (=) (=) Mp 4x10 gm (4.7) 
Me ag 

which is comparable to the mass of the smallest stars observed in our uni- 
verse. The mass of the Sun, for example, is Mọ = 2 x 1033 gm. Again, 
you see that the mass of a typical star is completely determined in terms 
of fundamental constants. We shall now discuss how much of the details 
regarding a star can be obtained from basic physical considerations. 
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4.2 Stellar magnitudes and colours 


Since a contracting body will usually have a higher density at the central 
region, nuclear reactions will be first triggered at the central region and 
the heat that is generated will be transported outwards through different 
physical processes. Once the nuclear reactions occur in the hot central 
region of the gas cloud, its structure changes significantly. If the transport 
of this energy to the outer regions is through photon diffusion then the 
opacity of matter will play a vital role in determining the stellar structure. 
In particular, you will see that the opacities determine the relation between 
the luminosity and mass of the star. 

A photon with mean free path 1 = (na)~!, random walking through 
the hot plasma in the interior of the star, will undergo Neon ~ (R/D? 
collisions in traversing the radius R. Therefore, it will take the time 
tese X (INeon/c) = (R/c)(R/l) for the photon to escape. [This is, of course, 
significantly larger than the time taken for collisionless propagation (R/c) 
by a large factor (R/l).] The luminosity of a star L will be proportional to 
the ratio between the radiant energy content of the star, Ey, and tesc. The 
radiation energy content is Ey ~ (aT*) R3 œ T4R? where T is essentially 
the core temperature. Then 

E, RT 


L x — x 


RT‘ 
tesc R? l 


x RT41 «x 
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(4.8) 


For a wide class of stars, we may assume that the central temperature is rea- 
sonably constant. This is because the nuclear reactions — which depend 
strongly on T — act as an effective thermostat. When the temperature 
increases, the nuclear reaction rate increases thereby producing more en- 
ergy. Since the system is in local virial equilibrium, Erot = (1/2) Egray and 
when the total energy becomes more positive, the magnitude of the gravi- 
tational energy, |Egrav| x M?/R decreases. This makes R increase which, 
in turn, decreases the density p ~ M/R°%. This decreases the nuclear re- 
action rate thereby acting as a thermostat. Further, virial theorem gives 
kBT ~ (GMm,/R) so that TR x M. If Thomson scattering dominates 
then o = or = |(87/3)(e?/m_c?)?] and we get 

x RT! T4R* M4 


x =— « M3, (4.9) 
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The situation is different if the opacity is provided by the interaction of 
photons with partially ionized atoms rather than by free electron scatter- 
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ing. The cross section for bound-free and bound-bound opacity in thermal 
equilibrium has been obtained in chapter 2, Sec. 2.5, where it was shown 
that 1 « T7/2n-? « T7? R8 M-?. In this case, we have 


L x RT4l œ RITM? x M2R", (4.10) 


Taking (GM/R) % constant so that R « M now gives L x M5. Taken 
along with Eq. (4.9), we expect the luminosity of a star to be related to its 
mass by L x M®% with a ~ 3-5. 

The discussion so far dealt with core temperature which governs the 
nuclear reactions and energy generation. The effective surface temperature 
T, of a star, of course, will be much lower. To determine T,, we note that the 
total blackbody luminosity from the star will be (4r R?)(ac/4)T4 (see the 
discussion after Eq. (2.22)). It is convenient to use this relation to actually 
define the surface temperature T, of the star with a given luminosity L and 
radius R by L = (ac)nR?T4. 

If the radiation from the star is approximated as that of a blackbody, 
then the intensity J, (which, as you will recall, is the energy per unit area 
per unit time per solid angle per frequency) of thermal radiation emitted 
by the star will be 


__ÆE č _ 2h 1 
~ dAdtdQdy ~~" ~ 2 ehv/kaTs — 1" 


We saw in chapter 2 that the quantity vB, (which gives the intensity per 
logarithmic band in frequency) reaches a maximum value around hy = 
4kBT which translates to the fact that a a star with surface temperature 
of 6000 K will have the maximum for vB, at 6000 A. Observationally, it is 
found that stellar surface temperatures vary from 3 x 10° K to about 3 x 104 
K as the mass varies from about 0.1Mo to 60Mọ. The corresponding 
variation in the radius is in the range of (0.8 — 70) x 101° cm while the 
luminosity ranges from (1078 — 10°:7)Lo where Le = 3.8 x 1033 erg s7? is 
the luminosity of the Sun. 

Using this description we can obtain a few more characteristics of a star. 
In principle, one can fit the spectrum of a star to a blackbody spectrum 
(approximately) and thus measure T,. The total energy flux l received from 
the star on earth (called the apparent luminosity) can also be measured 
directly. Since | = L/(47d?) where d is the distance to the star, we can. 
determine the absolute luminosity L of the star if the distance to the star 
is known. Assuming that the distance can be independently measured, we 
will be able to determine L and plot the location of the stars in a two 


È, (4.11) 


Chapter 4: Stars and Stellar Evolution 141 


dimensional (L — T;) plane. From our definition, it follows that T, œ 
LYV4R-1/2 œ £4/4M-/2_ Combining with M œ L'/5, valid when the 
interior is only partially ionized, we get T, «x L!/4L-1/19 q& 23/20, On 
the other hand, if Thomson scattering dominates with L « M3, we get 
T, x L'/12, Thus, if the stars are plotted in a log T, — log L plane, (called 
Hertzsprung-Russel diagram or H-R diagram, for short; see Fig. 4.6) we 
expect them to lie within the lines with slopes (3/20) = 0.15 and (1/12) ~ 
0.08. The observed slope is about 0.13 giving reasonable support to the 
basic ideas outlined above. The diagonal thick line from left top to bottom 
right in Fig. 4.6 shows an accurate numerical fit to the observed main 
sequence band. 

What we described above is a “theoretician’s” H-R diagram which plots 
the luminosity L of the stars against its effective surface temperature T,. 
Observational astronomers, however, like to plot a quantity called visual 
magnitude against the colour of the star measured in terms of the difference 
of the magnitudes in blue and visual bands. You need to understand what 
these terms mean, which we shall now discuss. 

For totally crazy (that is, historical) reasons, optical astronomers mea- 
sure the luminosity in terms of a quantity called magnitude (M) which is 
related logarithmically to the physical luminosity L by 


M = —2.5 log (=) + Mo. (4.12) 
© 


Here M stands for absolute magnitude (not to be confused with the symbol 
for mass) and you will notice that the brighter an object is, the lower is 
its magnitude. The logarithm is taken to base ten. One can define a 
corresponding quantity (called apparent magnitude, m) using the measured 
flux F = (L/4rd?). To take care of the dependence on d, the numbers 
are chosen such that the absolute luminosity of an object is the same as ` 
the apparent magnitude it would have if it is placed at a distance of 10 
parsecs; that is, m— M = 5log(d/10 pc). To complicate the matters 
further, the luminosity is usually measured within a given frequency range 
(usually called U,V, B etc. for ultra-violet, visible, blue ....) and one can 
have blue magnitude mpg, visible magnitude my etc. for the same object. 
The absolute magnitude, determined by integrating over all frequencies, is 
called bolometric magnitude. The relation between magnitudes and more 
sensible units can be expressed in the form 


Fx = Kx10-°4™X erg cm~? s7! (4.13) 
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where X denotes the band (Ultraviolet, Visible, Blue ...). For the standard 
cases you will encounter (called U, B, V, R, I, J, K; the last few being in 
infra-red), the constants are Kx = (2.90, 6.47, 3.38, 3.82, 2.00, 1.26, 0.19) x 
1076. In the case of total integrated flux and bolometric magnitude, the 
corresponding relation is 


F = 10704” x 2.52 x 1075 ergs cm 7s7*, (4.14) 


The primary effect of all these definitions is to scare physicists away from 
astronomy and you just need to overcome this fear to work in this area! 

Incidentally, it turns out (by accident, not by design) that 10704 = 
0.398 which is quite close to e~! = 0.367 so that magnitudes have a corre- 
spondence with e-folding. This allows you to use standard formulas to cal- 
culate magnitude differences: For example, since 10+0-4x0-01 ~ ¢9-1 ~ 1.01 
a magnitude difference of 0.01 actually corresponds to 1 per cent flux dif 
ference. 

These results can be used to estimate some relevant quantities. For 
example, consider the contribution of starlight to the background light in 
the sky in the optical band. A solid angle dQ will intercept a volume 
(1/3)R3dQ of our galaxy if R is the radius of the galaxy. If the number 
density of bright stars with luminosity of L ~ Lo is n ~ 0.1 pe, then the 
flux per steradian is 


Fr SR’ (nLo) nLoR~9x 107° W m~? rad™? (4.15) 


1 oy 1 
4nR? ~ 127 
if R = 10 kpc. Using 1 rad? ~ 4.25 x 101° arcsec”, we get a sky brightness 
due to integrated starlight of about 2.1 x 10716 W m~? arcsec7?. You 
should convince yourself that this works out to about 21 mag arcsec~?. 

For comparison, note that a galaxy consists of about 10! stars and could 
be located at distances ranging from 1 to 4000 Mpc. At 10 Mpc, such a 
galaxy will subtend an angle of about 0 ~ (2R/d) ~ 400” and will have a 
flux of about 3 x 1071} W m7? if the size of the galaxy Rgal ~ 10 kpc. The 
surface brightness of the galaxy will be about 10716 W m~? arcsec~? ~ 21 
mag arcsec”. These galaxies also contribute to the background light in 
the optical band. Repeating the above analysis we did for stars with L = 
10! Lo, R œ 4000 Mpc, n ~ 1 Mpc™3, we get a background of 5 x 1078 
W m`? arcsec”? ~ 24 mag arcsec™?. 

Another quantity of great observational importance, closely related to 
magnitude, is colour. To understand this concept, let us consider some 
source that emits radiation at three wavelengths ;,A2 and A3. The abso- 
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lute luminosities of the source at these wavelengths can only be determined 
if we know the distance to the source. However, one can determine the 
ratios L(à1)/L(à2) and L(A2)/L(A3) without knowing the distance to the 
source. These ratios, of course, contain important information about the 
spectral characteristics of the source. Since the magnitudes are related 
to the luminosities by logarithms, the ratio of luminosities gets translated 
to differences in the magnitudes. For example, the difference (mg — my) 
between the apparent magnitudes in blue and visual bands will measure 
the ratio of the total fluxes received from a source in the wavelengths cor- 
responding to B and V. (Of course, no instrument can measure the flux 
exactly at one wavelength; what is observed is the integral of F(A)W (A) 
over the wavelength where W(,) is a filter window which is nonzero in a 
small wave length band AX around a given wavelength.) This quantity 
(mp — my) is called the B-V colour indez and is often just denoted as B-V. 
More precisely, the U-B and B-V indices are defined through the relations 


ree 
U — B = —2.51 = } — 0.87, 4.16 
ee Gane ate) 
ee) 
B-V = —2.5log | ———— ) + 0.66 4.17 
j (poe aa 


where F, (erg cm? s~! À~!) is the flux density at the specified wavelength 
à and Ady = 680 A, AXg = 980 A, Ady = 890 A denote the width of the 
colour filters. The constants are a matter of convention. Clearly these refer 
to some kind of a ‘colour’ in the sense that if B-V is large and positive, you 
expect the source to be bluer than some other source which is peaked in the 
visible band. Colour indices are of great practical value because they can be 
obtained by just taking photometric data of the source with different filters 
and can still provide spectral information. Getting the actual spectrum 
of the source can be a lot tougher in many cases. Figure 4.1 shows the 
colour-colour plot for a thermal spectrum and for a power law spectrum 
with F, x v~%. The curves are marked by the temperature for the thermal 
spectrum and the index a for the power law spectrum. It is obvious that 
colours alone can provide fair amount of spectral information. 

Let us next consider another important feature of stellar spectrum viz. 
absorption lines. Since the temperature decreases as one goes from the cen- 
ter to the surface of the star, the radiation emitted from the hot interior can 
be absorbed by the cooler gas in the stellar atmosphere producing absorp- 
tion lines characteristic of the local conditions prevailing the outer regions. 
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Fig. 4.1 The colour-colour plot for Planck spectrum at different temperatures and a 
class of power law spectra Fy x v77 parameterized by the index a. 


A given absorption line will be produced by a particular element, in a par- 
ticular ionisation state, when an electron makes the transition between two 
energy levels. The probability for such an absorption by partially ionized 
elements in their excited states will depend sensitively on the temperature. 
At high ternperatures, atoms tend to be completely ionized and hence will 
not exhibit the discrete energy levels corresponding to bound states. At 
low temperatures, on the other hand, there will be very few atoms in the 
excited state which can produce the given absorption line. It follows that 
absorption due to atoms in their excited states will be maximum at some 
intermediate temperature — which can be estimated if the atomic prop- 
erties are known. Further, absorption lines due to different elements will 
peak at different temperatures. Hence by choosing different atomic line 
transitions, one can probe the stellar atmospheres effectively. Such a spec- 
troscopic analysis has led to a classification of stars, usually denoted by 
the sequence of letters O,B,A,F,...., etc. which are roughly in the order of 
decreasing surface temperature. (Again, these letters are not of any par- 
ticular significance and the ordering is of historical origin. One favorite 
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astronomical pastime is to find mnemonics to remember this sequence.) 
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Fig. 4.2 Figure showing why Balmer absorption lines (which are caused by the upward 
electronic transitions from n = 2 state of hydrogen) are seen most prominently in an 
intermediate range of temperature. The dotted line shows the neutral fraction of hydro- 
gen and the dashed line shows the fraction of these neutral atoms which are in n = 2 
state. Their product, shown by the thick unbroken line has a clear maximum. 


To illustrate this phenomena in a simple context, let us consider the 
Balmer absorption line due to hydrogen from near the stellar surface. This 
absorption occurs when an electron in the state n = 2 of the hydrogen atom 
absorbs a photon and makes a transition to any of the higher states. Hence 
the strength of the absorption line will be proportional to the fraction of 
hydrogen atoms which are in the n = 2 state. This fraction, in turn, is the 
product of (i) the fraction of hydrogen atoms which are neutral and (ii) 
the fraction of the neutral hydrogen atoms which are in the n = 2 state. 
To simplify the calculations, let us ignore other excited states and assume 
that the hydrogen atom is either in the n = 2 state or in the n = 1 state. 
The neutral fraction of the hydrogen atoms is M = (1 — y) where y is the 
ionisation fraction given in Eq. (3.73). If we denote the right hand side 
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of Eq. (3.73) by F(T) then it is easy to solve the quadratic equation and 
obtain N as 


N = $ [2+ F) — (F? +4F)"?]. (4.18) 


This quantity is plotted by a dotted curve in Fig. 4.2. As expected, hy- 
drogen is mostly neutral at low temperatures and makes rapid transition 
to ionized state at high temperatures. The relative population of neutral 
atoms in the n = 2 and n = 1 states is given by: 


_ N(n=2) _ M 


= = — 4 —AE/kBT — 4 —10.2eV/keT 4.19 
N(n=1) M ~ E (4.19) 


The factor 4 comes from the fact that the quantum state n has a degeneracy 
gn = 2n? giving a relative weight of 8/2 to the two states. The fraction of 
the total neutral atoms which is in n = 2 state is given by 


LA fo 
No+M™, ~ 14+R 


The dashed curve in Fig. 4.2 shows this fraction as a function of tempera- 
ture. Obviously when the temperature is low, most atoms are in the ground 
state while at high temperatures, some of the atoms can get to n = 2 state. 
The net fraction of atoms which will be contributing to the Balmer line 
absorption is given by the product 


No, M NitNa _ tz) 
Not Ni + Na No LEER ' 


This product is also shown by a thick line in Fig. 4.2. It is clear that there 
is a distinct maxima as a function of temperature due to two competing 
effects. Obviously, stars which show strong balmer absorption will have 
a surface temperature near this maxima while stars with either lower or 
higher surface temperature will not show this signature. 

Figure 4.3 shows the actual spectra of different types of stars (with dif- 
ferent surface temperatures) and the standard classification symbols which 
are used. One can clearly see that Balmer absorption is dominant at some 
intermediate temperature. Similar effects also explain other features — 
which could at first be surprising — in the Fig. 4.3. For example, there 
are stars in which Ca II absorption is more prominent than the hydrogen 
absorption in spite of the fact that hydrogen is about 5 x 10° times more 
abundant than calcium. The point is that at the relevant range of tempera- 
tures, only a fraction 5 x 107° of the hydrogen atoms are at the n = 2 state 


(4.20) 


(4.21) 
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Fig. 4.3 Spectra of different types of stars and the corresponding absorption lines (Fig- 
ure courtesy: R. Gupta.) 


capable of producing Balmer absorption. On the other hand, nearly 99 per 
cent of all the calcium atoms are in the ground state and can produce the 
Ca II K line at wavelength \ = 3933 Å. In other words, there are nearly 400 
times more Ca II ions capable of producing the absorption lines compared 
to the hydrogen atoms capable of producing Balmer lines. Similar features 
explain why you find exotic stuff like TiO in stellar surface at appropriate 
temperatures. 


4.3 Modeling stellar structure 


We will next try to get a feel for the detailed structure of the star. The first 
step will be to obtain the relevant equations which govern the structure of 
the star. To simplify matters, we shall assume that the star is in steady 
state, spherically symmetric and does not have any magnetic field. Then the 
first equation one could write down relates the mass M(r) = M,, contained 
within a sphere of radius r, to the density p(r) by (dM/dr) = 4rr?p. 
The second equation we have is that of hydrostatic equilibrium equating 
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the gravitational force to the pressure gradient: (dP/dr) = —GMp/r?. 
Together, we now have two equations 
1 
A Ge | Sed as (4.22) 
dM, Arr dM, 4rr2p 


relating three functions M (r), P(r), p(r). We can, of course, supplement this 
by an equation of state which will give the pressure in terms of the density 
and temperature: P = P(p,T) but that will bring in the new unknown 
function, T(r). (This is the general situation. Of course, in special case 
of barotropic fluids for which the pressure can be expressed as a function 
of density alone, P = P(p), the system of equations closes. This case 
was discussed in Sec. 3.2.) So we next need an equation to determine the 
temperature distribution T(r) inside the star, which in turn depends on 
how the energy is transported across the star. If the energy transport is 
radiative, we can relate the radiative flux F to the temperature gradient 
using Eq. (2.36): 

dT 3 kp Lr 


dr 4acT3 4rr? 





(radiation). (4.23) 


This result for (dT /dr), of course, is valid only if the entire flux of energy 
F(r) is transfered through radiative processes. In general, convection could 
also provide a means of transferring the energy and we need to determine 
when it is important. 

One simple condition for the absence of convection can be obtained 
as follows: Let us consider a fluid column located in a gravitational field 
in which a blob of matter with specific volume V(P,s) (which is volume 
of unit mass of matter) moves from a height z to a height z + € where 
the ambient pressure is P’. On displacement, its specific volume becomes 
V(P', s) if the motion is adiabatic and s does not change. This motion of the 
blob will be stable and the resulting force will drive it back to the original 
location provided the fluid element is heavier than the fluid it displaces 
in its new position. The specific volume of the latter is V(P’, s’) where 
s’ is the equilibrium entropy at the height z + £. For stability we must 
have V(P’,s’) — V(P’,s) > 0. Writing, s’ — s = €(ds/dz) and using the 
thermodynamic! relation (9V/0s)p = (T/Cp)(OV/OT)p > 0 we get the 
condition for stability to be (ds/dz) > 0. That is, the entropy must increase 
with height for convective stability. 


1This is easy to prove from the definition Cp = T(0s/8T)p and the relation 
(0s/8T) p = (08/0V) p(OV/8T)p. 
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This result can be expressed in terms of T and P by noting that adi- 
abatic evolution (s = constant) implies T « P‘7-)/7 (see Eq. (3.10)). 
Hence, if s = constant, the temperature gradient is (dInT/dln P) = Vaa = 
(y — 1)/7. So, in the case of stellar interior, this condition for convective 
instability can be expressed as V > Vag where 


_ ant, wy aa 
= dinP’ a i 


Using Eq. (4.23) and Eq. (4.22) we can find the gradient in the case of the 
radiative transfer of energy to be 


dinT 3K P Ly 
Vraa (T 5) 16rac (=) (az) 29) 


If the gradient Vraq arising out of radiative transfer is less than Vaq, then 
the fluid is stable against convection and we can use Eq. (4.23). On the 
other hand, if Vraa is greater than Vaa, we have to take into account the 
energy transported due to convection. (Note that Vrag increases with x; 
high opacity tends to cause convective instability.) This is a complicated 
problem and we do not yet have a fundamental theory describing convective 
energy transfer. However, if we assume very efficient condition, we may 
approximate the gradient to be that of the adiabatic value and assume that 
V = Vaa. This gives 


EEE m 


One crucial feature regarding convection is that it is a very fast process 
compared to radiative transfer. The acceleration induced on a blob of 
gas moving under convection is about (GM,/r?)(AT/T) where AT is the 
temperature difference. If we take AT = 1 K, then a blob of gas can travel 
from 0.5Ro to 0.1Rọ in about 3 weeks time which is considerably shorter 
than the time scale for radiative diffusion. 

We now have three differential equations (two in Eq. (4.22) and either 
Eq. (4.23) or Eq. (4.26) depending on whether energy transport is radia- 
tive or convective) and one equation of state relating the five variables, 
P,M,T,L and p. To close the system, we need one more equation which 
relates the luminosity L(r) to the energy production rate e(p,T) per unit 
mass. This is given by 


(4.24) 





dbx 
am e(p,T). (4.27) 
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In the simplest context, e denotes the energy production due to hydrogen 
fusing to form helium. The hydrogen fusion leading to helium can occur 
through either of the two sequences of reactions (called p-p reactions or 
CNO cycle) depending on the temperature. The form of e for these two 
nuclear reactions cycles are given by the fitting functions: 


_ 2.4 x 104pX? o-3:380/T}/? 


Epp(p, T) & 2/3 erg g7! a (4.28) 
9 


= ee | 
ergg os. (4.29) 
Tè” 


ecno(p,T) ~ 


where Ty = (T/10° K) etc.. Stars with M > 1.3Mọ have CNO cycle as the 
dominant reaction channel while stars with M < 1.3Mọ has p-p reactions 
as the dominant channel. Of these, the CNO cycle has significantly higher 
temperature sensitivity, resulting in a larger temperature gradient in the 
core region, which — in turn — can lead to convection. Hence stars with 
M 2 1.3Mo have a convective core while stars with M < 1.3Mo have a 
radiative core. When hydrogen burning occurs in a convective environment 
the ashes of burning will be uniformly distributed over the core region 
by convection in stars with M = 1.3Mo. It is obvious that stars with 
M $1.3Mg will be structurally different from those with M 2 1.3Mo. 
The equations of stellar structure thus require three input functions 
P(p,T),«(p,T) and e(p,T) for their integration. In general, these func- 
tions have fairly complicated form since they incorporate several different 
effects. The pressure, for example, can come from an ideal gas plus radi- 
ation plus even degeneracy pressure for some range of stellar parameters; 
the opacity can come from free electron scattering as well as free-free and 
bound-free absorption. We have already seen in Eq. (4.28), Eq. (4.29) that 
the nuclear energy generation can have a complicated density and temper- 
ature dependence. Given such complexities, it is necessary to integrate the 
relevant equations numerically to obtain the properties of stars. (The inte- 
gration is fairly straightforward; what is difficult is getting the correct input 
physics as well as modeling convention.) The results of such numerical inte- 
gration is reasonably well fitted by the following fitting formula which gives 
the luminosity and the radius of the star in terms of its mass m = M/Mo 
at the start of the main sequence (usually called Zero Age Main Sequence, 
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ZAMS for short) 


1.107m? + 240.7m® ree 
p | 1+2819m™ Em < 1093; 


== (4.30) 


13990m y ` , 
mi + 2151m? + 3908m + 9536’ if m > 1.093; 


0.1148m!:?> + 0.8604m? 5 fme . 

a of oom oO SE 

R. (4.31) 
2.887 1.679 

1968m Ee if m > 1.334. 


(Of course, the right hand sides will not lead to unity for m = 1 and 
they should not; try to figure out why.) These relations as well as the 
resulting surface temperature T, obtained by oT4 = L/(47 R?), are plotted 
in Fig. 4.4. The solid curve refers to the luminosity which is given in the 
left vertical axis; the dashed curve gives the radius and the dot-dash curve 
gives the temperature and these are referred in right hand side vertical axis. 
It is clear that different physical phenomena contribute in different stars 
making these relations fairly complex. 

If you don’t care for such complexities, but just want to get a feel for 
which effects lead to which results, then one can make a fairly drastic 
approximation to obtain the relevant scalings. This requires assuming that 
the three input functions P, x, € are power laws in p and T. It is conventional 
to take this in the form 


= poP*T~’; k= Kop"T~*; €=e9pT”. (4.32) 
p=p 


Such an approximation for p and « will be valid if any single process makes 
the dominant contribution to pressure and opacity. The approximation of 
€ by a power law is valid in limited ranges depending on which nuclear 
reaction dominates the energy production. Both the reactions in Eq. (4.28) 
and Eq. (4.29) — and, in fact, more complicated helium burning, silicon 
burning and oxygen burning reactions — can be approximated by local 
power law € = €9 pT”. For the p-p chain \ = 1,v = 4; for CNO cycle, 
à = 1,v = 15 to 18. Similarly, for reactions involving helium burning, 
A = 2,v = 40 ete. 

If we make the power law approximation in Eq. (4.32) then we expect 
homologous solutions with power law dependence for all the parameters. 
While such a system can be easily written down and solved, the net effect 
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Fig. 4.4 Luminosity, radius and surface temperature of the zero age main sequence 
stars as a function of mass. The solid curve refers to the luminosity which is given in 
the left vertical axis; the dashed curve gives the radius and the dot-dash curve gives the 
temperature and these are referred in right hand side vertical axis. 


is easy to mimic by the following trick: You just replace all derivatives by 
ratios like dP/dr — P/R etc. (It works for all homologous systems; you 
will recall that we used the procedure in the case of Lane-Emden equation; 
see the discussion after Eq. (3.29).) In this case, Eq. (4.22), coupled with 
P «x pT reduces to P x M?/R*,p x M/R3,T x M/R. Substituting these 
into Eq. (4.23) and using L « eM, we get the simple relation ex = M?. 
Using Eq. (4.32) in which € and « are given by power laws and expressing 
p and T in terms of M and R, we will get a simple relation between M 
and R, which can be expressed as R x M”F. Substituting back in the 
previous relations, allows you to express everything else as power laws in 
M: like T x M"T, Lax M",T, x M™#® etc. You can play around with 
different assumptions, leading to different power law indices in Eq. (4.32) 
and compare the results with the more exact fitting formula in Eq. (4.30), 
Eq. (4.31). The key results of such an analysis can be summarized as 
follows. 
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First, for the high mass stars with CNO cycle being the dominant en- 
ergy production mechanism and electron scattering providing the opacity, 
homology relations give ng œ% (0.78 — 0.81),n, = 3 and nyar % 0.12 de- 
pending on how you model pressure. The observed values are closer to 
nr = 0.56,n, x% 3.5 and nyr % 0.17. These are in the same ballpark and 
the difference is essentially due to the convective regions in the stars. 

Second, for low mass stars dominated by p-p chain, we have to con- 
sider both electron scattering opacity and Kramer’s opacity. For Kramer’s 
opacity, homology models give nr % (0.1 — 0.2), nz ~ (5.4 — 5.5), nR © 
(0.23 — 0.24). With electron scattering opacity, we get nr œ (0.43 - 
0.5) ne © 3,nuR =% (0.17 — 0.18). The observed values are closer to 
nr =œ (0.85 ~ 0.96),n_ = 4.8,nHR = 0.13. It is clear that, while the 
broad trends are captured by these models, the details cannot be repro- 
duced. In fact, numerical integration shows (see Fig. 4.4) that the relevant 
parameters of the stars show different power law indices in different ranges 
and cannot be reproduced by simple homology arguments. 

In all the above discussion we did not say anything about the chemical 
composition of the star, which, of course, is an important issue. We will 
see in chapter 6 that, when the universe was just a few minutes old, some 
amount of nucleons combined together to form Helium which amounted to 
about 23 percent by weight. (We will also see that none of the heavier 
elements were produced in significant quantities.) As the universe evolves 
and the first generation of stars (usually called Pop I) form, they will have 
essentially such a composition: X ~ 0.77,Y ~ 0.23, z ~ 0 where X,Y,Z 
denotes the fraction of hydrogen, helium and heavier elements (metals, as 
they are called!) respectively. As these stars evolve, heavier elements will 
be synthesized in them and — as we shall see later — these elements will be 
spewed out into interstellar medium through supernova explosions. Later 
generations of stars (called Pop IT) will have traces (say, Z =œ% 0.02) of 
heavier elements. In several contexts, it is useful to know how the various 
stellar parameters like luminosity, radius etc. depend on the metallicity. 
Similarly, as stellar nuclear reactions produce heavier elements, the mean 
atomic weight u of the stellar material will change and it is also relevant to 
ask how the results depend on p. These are important in determining, for 
example, the track of the star in HR diagram as the evolution proceeds and 
p and Z change. In these two cases, the homology relations can be more 
reliable. 

The mean atomic weight u enters the equations mainly through the 
ideal gas law (and somewhat more weakly through the opacity). Keeping 
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the leading dependence which arises through the ideal gas laws, one can 
easily show that the radius, temperature and the luminosity of a star of 
a given mass scales as R x p™?,T œ u™T,L x p™+. For CNO cycle 
(v = 15 — 18) and electron scattering opacity (n = s = 0) and ideal gas 
equation of state, we get 


mr =4.0, mpr =0.58— 0.63, mp = 0.42 — 0.36. (4.33) 


This shows that the radius and temperature vary rather weakly with the 
mean atomic weight while the luminosity is a strongly increasing function. 
(For p-p chain, with ideal gas equation of state and Kramer’s opacity, the 
dependence is even steeper and we get mz, œ% 8.) For the first case, the 
slope in H-R diagram will be 


jee (1 5 ame ) æ 0.17 (4.34) 
4 mL 

which is higher than the corresponding value nyr ~ 0.12. This implies 

that as the atomic weight increases, say due to the conversion of hydrogen 

into helium, the stars will have higher L but will evolve below the main 

sequence line in the L — Teg plane. 

Since Z is usually a small number, the heavy element abundance does 
not directly influence u; however, the opacity has a dependence on Z since 
the metals contribute significantly to opacity. Using the power law approx- 
imations for u and « given by u ~ 0.5X 7°57 and x x Z(1 + X)pT~3° for 
low mass stars? (with p-p chain reaction as the main source of energy and 
Kramer’s opacity as the transport process), it can be easily shown that 


Rx 2015 x068. F, o Z711 x -5.0, Teff x Z-0:38 -1.6 (4.35) 


approximately. These combine to give L « Z°35X155, The actual Z 
dependence is weaker than the one indicated above, since the electron scat- 
tering opacity is not taken into account; but the relation shows the correct 
trend. Both luminosity and the effective temperature increase with de- 
creasing Z. For a star of given mass but varying Z we get the relation 
Teff x L™?3. In other words, the main sequence for metal poor stars, will 

2The Z in this expression for « corresponds to the mass fraction of metals; so naively 
this expression will give k = 0 for, say, pure hydrogen. Of course, this is not true in 
the rigorous sense; but at the range of temperatures we are interested in, hydrogen will 


be fully ionized and the bound-free opacity indeed arises only from metals. Thus the 
scaling k œ Z is correct as far as its applicability to the stellar interior is concerned. 
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lie below and to the left of the main sequence for metal rich stars in the 
L — Tes 7 plane. 

The scaling relations obtained above refers mostly to the bulk of the star. 
The situation in the outermost region of the star (usually called the stellar 
envelope or stellar atmosphere), is somewhat different. Here the radiative 
processes are still important in determining the passage of energy but there 
is no energy production or significant self gravity. The variation of different 
physical quantities in the stellar envelope can, therefore, be obtained by 
assuming that the stellar envelope is located in the (external) gravitational 
field of the rest of the star. Let the energy flux of radiation through the 
envelope be Fyaq. Using Fraa = —(ac/3Kp)VT* and VP = —gp, where g is 
the local acceleration due to gravity at the envelope due to the rest of the 
star, we have 


gcadT*  gcaT* 
bj eS rs 4. 

a 3k dP 3r P ee 
where we have approximated (dT*/dP) by (T*/P) in the spirit of homology 
relations. Taking g «x (M/R?),« x pT~**, p œ (P/T) and using the fact 
that F œ (L/R?) is a constant in the envelope, we get the relations: 


1/2 1/2 

KOE ea Eo on re 

R? R? P2 L L 

(4.37) 
The radial dependence of temperature follows from the equation (dP/dr) = 
(P/r) = —gp x —(M/r?)(P/T); this gives T œ (1/r) in the envelope and 
fixes the variation of all relevant quantities in the stellar envelope. 

The entire discussion of stellar structure given above was based on the 
star remaining in a steady state. There is, however, one important physical 
phenomena which can take place in stars which are in ‘almost’ steady state. 
This corresponds to oscillations of the star around its equilibrium config- 
uration, leading to important observable effects. While most stars have a 
relatively stable luminosity during the significant part of their life, some 
stars show striking variability in their energy outputs. These intrinsically 
variable stars have their light output changing in a periodic or semi-periodic 
fashion due to internal processes. The simplest of the variations arises due 
to radial pulsations of the stars at some period P determined by the star’s 
internal structure. An intuitive description of this result can be obtained 
as follows. We start with the equation of motion for a shell of mass m at 
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radius r: 


mi = 4rr?P — on : (4.38) 





In equilibrium, at r = reg, we have the condition 4rr2 Pea = (GMm/r?2,). 
Perturbing the equation around this equilibrium value, by taking r = reg + 
ôr, we get 





2 
d*ér 26r =) GMm (=) (4.39) 


E 
mg Teata (2 Paa ae 
which governs the radial displacement of a shell of matter confined by r = 
Teq Originally. Writing (P/P) = y(ôp/p) = —3q(őr/r) (where the last 
equality follows from the mass conservation pr? = constant) we get 
dôr GM 


m 
= —-—— — Alor. : 
ma = [3y — 4] ôr (4.40) 





This equation shows that ôr is oscillatory with a local frequency w? = 
(GM/r3,) (37-4) « p(37—4). The frequency scales as \/p and the stability 
requires y > (4/3). (This is something we noticed earlier; see Eq. (1.57).) 
The period of oscillation is directly related to the mean density p of 
the stars and hence can be connected to the luminosity and surface tem- 
perature. A more rigorous calculation confirms the result that the period 
of oscillation P = (27/w) satisfies the relation P,/p = constant. For a 
given mode, the right hand side depends only on the polytropic index n 
(introduced in chapter 3; Sec. 3.2) and yaq. This relation is called the 
period-luminosity relation A reasonable fit to the observations is given by 


log (=) = —17.1 + 1.49 log (=) + 5.15 log Tor. (4.41) 
Since both the surface temperature Teg and P can be measured without 
knowing the distance to the star, this relation allows us to determine the 
absolute luminosity of a variable star without knowing its distance. By 
measuring the apparent luminosity and comparing it with absolute lumi- 
nosity, we can estimate the distance to the star. In particular, if such a 
variable star is spotted in another galaxy, this technique will provide an es- 
timate of the distance to the galaxy — which is of considerable significance 
in cosmology. This is one of the examples in which detailed modeling of 
a particular class of stars helps in the understanding of issues which are 
important in a wider context. 
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4.4 The Sun as a star 


We shall now illustrate several aspects of the above description of stellar 
structure using the Sun as a prototype main sequence star. Numerical in- 
tegration of the relevant equations (leading to what is usually called stan- 
dard solar model) provides the following information about the Sun. (i) 
The central pressure and central density are P, = 2.4 x 10!” dynes cm~? 
and pe = 154 gm cm~?. (ii) The pressure falls to half its central value at 
0.12Ro; the temperature falls to half its central value at 0.26Ro; the lumi- 
nosity falls to half its value at 0.11Ro and half the hydrogen is consumed 
virtually at the origin. Here Ro = 6.96 x 10!° cm, is the radius of the 
visible disk of the Sun. We shall first try to understand these values. 
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Fig. 4.5 The temperature, pressure and density inside the Sun scaled with respect to 
the central values. (Figure courtesy: H.M.Antia.) 


Since density can be taken to be approximately constant near the center, 
(dP/dr) % —G(4n/3)p2r near the center. Integration gives P(r) ~ Pe — 
(2r /3)Gp2r? leading to P = P,/2 at r172 = (3P./4nGp2)!/? = 0.1Ro. This 
is pretty close to the value of 0.12 Re obtained from numerical integration. 
Next, since Sun is radiative near the core, V = (dln T/dln P) is less than 
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Vaa = (1—77!) = 0.4 (for y = 5/3); thus the temperature must vary more 
shallowly than T œ P®* and at the radius where T has dropped by a factor 
2, the pressure would have dropped at least by 225 = 5.7. In reality, the 
ratio is more like 13 which suggests a scaling law T x P°-?”. This has to be 
expected, since numerical work also shows that the gradient V varies from 
0.34 at center to 0.23 at r = 0.26Ro and to 0.19 at 0.5Rọ. 

The energy generation in the Sun is primarily due to p-p chain and 
the thermonuclear reactions take place in the core region with r < 0.2Ro. 
In the region 0.2Ro <r < 0.7Rọ, the energy transport is radiative while 
in most of the outer regions 0.7Ro < Ro, the energy transport is due to 
convection. (This is seen from the fact that the adiabatic and radiative 
temperature gradients inside the Sun cross over around 0.71 Ro.) The p-p 
reaction rate which generates solar energy, varies as 


Pepp X X?T P x X? PRT? x X? PS (4.42) 


where the last relation follows from the scaling T « P°-?? mentioned above. 
If q = r/(V2ri/2), then the pressure varies near the center as P « (1 — 
q’) leading to an energy generation epp « (1 — q?)*/?. Using this it is 
easy to find the value of q at which half the total luminosity is generated. 
We get q1/2 = 0.52, predicting that half the luminosity should come from 
within 0.074Ro. This is smaller than the true value of 0.1 Rọ obtained from 
numerical integration because we have neglected the variation of X. 

Since half the hydrogen is depleted at the center, more energy generation 
is occurring at larger radii. Since Sun is tg = 4.5 Gyr old and the efficiency 
of pp chain is 0.007, the amount of hydrogen which must have fused is about 
(Loto/0.007c?) = 0.046Mo. The mass within half luminosity radius is 
0.097Mo and thus originally contained 0.097X Mọ = 0.068Mo of hydrogen. 
In this region, (0.046M /2) = 0.023Mo of hydrogen must have been burnt. 
This is a fraction (0.023/0.068) ~ (1/3) of the total mass of hydrogen in 
this region. 

At the center of the Sun, u = 0.829, p = 154 gm cm~? and T = 1.54 x 
10’ K. The gas energy density is €gas = (3/2)(o/ump)kpT 7% 3.6 x 1017 erg 
cm~* while the radiation energy density is crad = aT’ ~ 4.25 x 1014 erg 
cm—*. The ratio between them is about 1.2x 107°. At half the solar radius, 
u = 0.605, p = 1.43 and T = 3.9 x 10° K, giving egas = 1.16 x 1015 erg cm~’ 
and €raq © 1.75 x 10! erg cm~° giving a ratio of 1.5 x 1073. In fact, this 
ratio is nearly constant in the Sun throughout the range 0 < r < 0.8Ro. 
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This constancy implies that 


T —3 
R = (f=) (rx) = constant (4.43) 


giving p x pT œ p*/3; thus, most of the Sun can be described by a n = 3 
polytrope. The gravitational binding energy of a polytrope of index n = 3 
is [see Eq. (3.28)] 


3 GM? 3 (GMo 
Uew =~ { R )=-$( Ro ): (4.44) 


Virial theorem, on the other hand, gives Uxinetic = —Ugray/2. To relate 
thermal energy Uth to Uxinetic, note that 


Urinetic _ (3/2)NkeT _ (3/2)[Cp—Cv] _ 3 
ap ee (4.45) 











Since Sun is mostly ionized particles, y = (5/3) and Uth © Ukinetic. This 
gives the thermal time scale (also called Kelvin-Helmholtz time scale) to 
be tku = (Um/Lo) = 7 x 10% s which is a factor (350 — 700) larger 
than diffusion time scale taim. The reason is quite simple. All the Sun’s 
radiation leaks out at t =% taig but it is replenished by the thermal energy 
of matter. So for all the Sun’s thermal energy to be carried off in radiation, 
it has to leak out by a larger factor of (€gas/€raa) œ% 700. This result is 
a direct consequence of the equation for radiative diffusion (dP,aq/dr) = 
~KpL/(4mer?). Writing 

dPrad Prad dP yas Le, Prad 2€gas 
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dr Pras dr “Paa 3r 








(4.46) 


we find that L = (€rad/€gas) (Amr /3)€gas(c/Kpr?). Noting that tag = 
(«pr*/c) = r*/cX, this equation can be written as tkH © (€gas/€rad)taift- 

For an ideal monotonic gas, the onset of convection takes place when 
(dinT/dln P) > 0.4 {see Eq. (4.24)] which occurs around r + 0.7Rọ for 
the Sun. Also note that very close to the surface, radiative conditions will 
prevail and hence solar atmosphere has to be treated separately. 

One key result of modeling the solar interior is that Sun emits a steady 
flux of neutrinos which should be detectable at Earth. The mean solar 
neutrino flux on Earth can be easily estimated by taking the dominant 
reaction channel. The cumulative effect of a p-p chain is to convert 4 
protons into one helium nucleus, releasing 2 positrons, 2 neutrinos and 
about 28 MeV of energy, which is eventually radiated as photons. Since 


160 An Invitation to Astrophysics 


2 neutrinos come out with the release of 28 MeV of energy in the form of 
radiation, the total number of neutrinos released per second will be about 
2(Lo/28 MeV). Hence the neutrino flux on Earth, at a distance r from the 
Sun, will be about 


z 2Lo = 2x 4x 10%? erg s7! Span ee aa 
4nr?(28 MeV) 4r(1.5 x 1013 cm)? x 28 MeV 
(4.47) 


Several experiments have been set up to measure the flux-of solar neutri- 
nos arriving on Earth. One of these procedures involves allowing the solar 
neutrinos to react with ?’Cl thereby producing an electron and 3” Ar. The 
product 3’ Ar is radioactive with a half life of 35 days. By measuring the 
amount of Argon nuclei which is produced, one can estimate the number of 
neutrino captures per second which took place in the target. (This reaction 
requires a threshold energy of 0.81 MeV for the neutrino and hence is sen- 
sitive only at higher energy neutrinos.) With an absorption cross section of 
about 10744 cm? and an effective flux of about 10° neutrinos cm~? s~1 one 
would expect 10735 reactions per second per target atom. It is convenient 
to introduce a unit called Solar Neutrino Unit, abbreviated as SNU, which 
stands for 10738 captures per second per target atom. From Eq. (4.47) one 
expects a detection at the level of about 10 SNU. A more precise calculation 
gives 7.7+}'2 SNU where the error bar covers uncertainties in nuclear reac- 
tion rates, opacities, model building techniques and is fairly conservative. 
Observations, however, gave a much lower value for the solar neutrino flux, 
around 2.55 + 0.25 SNU. This discrepancy has been originally called the 
solar neutrino problem. 

We now understand that the solution to the solar neutrino problem 
lies in the phenomenon of neutrino oscillations. In the simplest models 
describing electro-weak interactions, it is assumed that there are 3 different 
species of neutrinos, viz. electron neutrino (ve), muon neutrino (v,) and 
tau neutrino (v) all of which are massless. In that case, each neutrino will 
retain its identity as it propagates from Sun to Earth and the solar neutrino 
experiment will essentially measure the flux of electron neutrino ve; the 
experiments are insensitive to the other two types of neutrinos. There is, 
however, fair amount of evidence that the neutrinos are actually massive. 
If that is the case, then one can construct particle physics models in which 
a phenomenon called neutrino oscillations can exist. In this phenomenon, 
the nature of the neutrino changes periodically in time so that the identity 
of the neutrino can oscillate between, say, Ve state and the v, state. It is 
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then possible to arrange the masses of the neutrinos in such a way that the 
deficit of electron neutrinos seen in terrestrial experiments can be explained 
as due to “conversion” of Ve to Vp, say, as the neutrino propagates from the 
Sun to Earth. 

Finally we consider a phenomenon of some terrestrial significance orig- 
inating from the sun, viz. the solar wind. There is a steady flow of hot 
material from the sun, which can be modeled as the time reversed solu- 
tion to accretion (see Sec.3.3.1) viz. a wind flowing from a central object. 
The solar wind can be modeled (to the lowest order of approximation) as 
a spherically symmetric flow of a non dissipative fluid in steady state. In 
this case, the equation of state is best approximated as P = pkgT for 
which Eq. (3.42) is not valid. However, it is easy to integrate the original 
Eq. (3.40) and obtain 


v? v? r  2GM 
z — log ee 4 log a + ra + constant (4.48) 


8 Ss 





where re = (GM/2c?) is the critical radius at which the velocity of flow is 
equal to the isothermal sound speed cs = (kgT)/?, which is now a constant 
independent of r. The constant of integration can be evaluated by using 
V = Ca at r =r, and we get its numerical value to be 3. Substituting back 
and rearranging the terms, we can write the solution as 


z > (4.49) 


where M = (v/cs) is the Mach number. The relevant solution is the one in 
which v is small near the solar radius r = Rọ and increases to supersonic 
values for r >> re. At such large radii, the first and third terms of Eq. (4.49) 
balance each other giving 


v & 2cesy (log(r/re)). (4.50) 
The corresponding pressure and the density falls as 


Pxpa« (4.51) 


1 
r2,/(logr) 


at large distances. On the other hand, close to the solar radius, v < cs and 
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we can ignore the first term on the left hand side of Eq. (4.49) and obtain 
M = 4n R2,p(Ro)v(Ro) = 4m R2,(Ro)esM (Ro) 


~ Anr2p(Ro)cs exp G — ze) ; (4.52) 
To make numerical estimates relevant to solar wind, we note that, near 
the surface of the Sun, p ~ 10715 gm cm~, c, © 1.3 x 107cm s7!, giving 
M = 1.4 x 10!2 gm s7! = 2 x 10-14Mo per year. From the solution we 
can now estimate v and p at the location of earth at r = 1 AU = 1.5 x 10 
cm. We get M ~ 3.8 and v = 500 km s~!. From the conservation of M, it 
follows that the density of particles in solar wind at the location of Earth 
is about p ~ 10773 gm cm~3. These values match reasonably well with the 
observed features of solar wind. 


4.5 Overview of stellar evolution 


So far the discussion concentrated on the time independent, equilibrium, 
structure of stars powered by nuclear reactions. To have a complete pic- 
ture, this needs to be supplemented by several aspects of stellar evolution, 
starting from the formation of the stars and proceeding to its late stages. 
Being a time dependent problem, this is inherently more complicated than 
what we have discussed so far. In this section, we shall provide a simplified 
discussion of the key features of stellar evolution, to give you a flavor of 
what is involved. 


4.5.1 Formation of a main sequence star 


We will see in later chapters that when a galaxy forms in an expanding uni- 
verse, it will be clumpy and contain fair amount of smaller sub-structures. 
Our own galaxy contains large molecular clouds with masses in the range 
(10° — 10°)M, with temperatures (10-100) K and densities (10 — 104) 
cm~3. To form stars by collapsing such gas clouds requires an increase 
in density by a factor of about 1074 and an increase in temperature by a 
factor of about 10°. This is an inherently complex process and many of 
the details are still unclear. We shall provide a simplified discussion of a 
possible scenario. 

Let us analyse the collapse of a gas cloud under self gravity, ignoring 
the effects of rotation and magnetic field and treating it as spherically 
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symmetric. The first question we need to address is an estimate of the 
mass which can collapse under self gravity. The gravitational potential 
energy of a cloud of mass M and radius R is U = —(3/5)GM?/R (see 
Exercise 1.1 of Chapter 1) while the thermal kinetic energy is (3/2)NkpT 
where N = (M/umy) and p is the mean molecular weight. It was shown 
in chapter 1 that such a cloud is unstable to collapse if the magnitude of 
the gravitational potential energy is larger than (about) twice the kinetic 
energy. This gives the condition for collapse to be 


3MkgT Le GM? 
pmuy 5 R’ 





(4.53) 


Eliminating the radius of the cloud in terms of the mean density po = 
(3M/47R3), this criterion can be expressed as M > Mj where 


My = (Seo) ( 3 )" (4.54) 


Gumy Ar po 





is called the Jeans mass. Numerically, one finds that 


. T 3/2 po —1/2 
~ —3/2 
My =1.2x 10 Mo (ox) pe gmc =) H / . (4.55) 


The density p + 107?4 gm cm~? and temperature T ~ 100 K are reasonable 
for clouds in interstellar medium. In such a case, the above result shows 
that only objects with M > 105 Mọ can collapse initially. You see that this 
value is much higher than typical stellar masses. 

When such a large cloud starts collapsing under self gravity, its physical 
parameters will change during the course of the collapse, thereby changing 
the Jeans mass itself. If the Jeans mass decreases as the cloud collapses, 
then subregions inside the original cloud can become locally unstable and 
start collapsing themselves. This process will continue with smaller and 
smaller inhomogeneities undergoing gravitational collapse, as long as the 
Jeans mass continues to decrease. Therefore, it is quite possible that lower 
mass objects, which should act as pregenitors of stars, arise due to such a 
secondary phenomena — namely, fragmentation inside the collapsing cloud. 


The Jeans mass in Eq. (4.55) depends on the temperature and density 
as Mj x T3/2p71/2. For a collapsing cloud, p keeps increasing; whether 
My increases or decreases will depend on the evolution of the tempera- 
ture T. As the cloud collapses, it releases gravitational potential energy 
which can either: (a) increase the internal energy of the material and cause 
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pressure readjustment or, (b) be radiated away if the time scale for cool- 
ing, (tcool), is shorter than the time scale for free fall collapse (tq). If the 
cloud manages to radiate away the gravitational energy which is released, 
then it can contract almost isothermally. Equation (4.54) shows that, for 
isothermal contraction, Mj varies as p~1/?. 
creases and the Jeans mass decreases allowing smaller regions inside the 
gas cloud, containing fragments of smaller mass, to collapse on their own 
gravitationally. Under most of the interstellar conditions, the collapse time 
scale tff œ (Gp)—'/?, is much larger than the cooling time scale of the 
gas. Hence it is reasonable to assume that collapse of the cloud will be 
isothermal in the initial stages of the contraction. 

This process will cease when the condition for isothermality is violated 
and Jeans mass starts increasing. This will occur when, at some stage in the 
evolution, the cloud becomes optically thick to its own radiation. After this 
stage, the collapse will be adiabatic rather than isothermal. For an ideal, 
mono-atomic, gas with Vaa = (2/5), the temperature scales as T « P?/5 œ 
p*/3 during adiabatic compression. Then My œ T3/2p71/2 œ p!/?, which 
means that the Jeans mass will grow with time. Hence the smallest mass 
scale which can form due to fragmentation will correspond to the Jeans 
mass at the moment when the cloud makes a transition from isothermal to 
adiabatic evolution. 

We can estimate this lower limit to the mass as follows. During the free 
fall time of a fragment, tg ~ (Gp)~!/?, the total energy radiated away is of 
the order of E ~ (GM?/R). In order to maintain isothermality, the rate of 
radiation of energy has to be 


As the star contracts, p in- 


E GM2? 1/2 (93/2 y75/2 
Ca = (2) ie (4.56) 


Ax — x — — 
ta R An R5/2 


However, we have seen in chapter 2 that a body in thermal equilibrium at 
temperature T cannot radiate at a rate higher than that of a blackbody at 
the same temperature. Hence the rate of radiation loss by a cloud fragment 
can be written as 


B = (4r R?) (o T*) f (4.57) 


where f is a factor less than unity. For isothermal collapse, B >> A and a 
transition to adiabaticity occurs when B = A. This happens when 


= [25 a 


: a (4.58) 


-| 
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The fragmentation would have reached its limit when the Jeans mass is 
equal to this value: Mz = M. Replacing M by Myj and expressing R in 
terms of the density, we can estimate the numerical value to be 


T1/4 


My = 0.02 Mo Fina, af 


(4.59) 


where T is in Kelvin. For T ~ 10° K and f ~ 0.1, we get M ~ 0.36Mo; this 
result does not change much if the parameters are varied within reasonable 
range, because of the weak dependence on T and f. Thus collapse of a cloud 
can lead to fragments with masses of the order of solar mass or above, but 
not significantly below. 

It must be stressed that the existence of rotation and magnetic field 
— the two factors we have ignored — can have significant effects on the 
collapse of such a protocloud. For example, if a cloud with an initial radius 
of 50 pe and a slow rotation of 1078 rad yr~? collapses to a final radius 
of about 100 AU, its angular velocity will go up to about 100 rad yr7! 
if the angular momentum is conserved. This corresponds to a period of 
a few days and the cloud would literally fly apart. Obviously, the cloud 
must find a way of losing its angular momentum if the collapse scenario 
has to work. Similarly, the interstellar magnetic field of about 1075 G will 
be amplified to more than about 104 G if the collapse proceeds with the 
magnetic flux being frozen during the collapse. Such high fields are not 
observed in the stars and hence there must also exist some mechanism to 
dissipate the magnetic field as the cloud collapses. 

Further evolution of collapsing cloud leading to formation of protostar is 
quite complicated and many of the details are still uncertain. Eventually, 
several of the collapsing fragments reach a sufficiently hot core in which 
nuclear reactions can take place. In general, lower mass cloud fragments 
will be more abundant inside a given interstellar cloud. Hence the number 
of stars that form per unit volume per unit mass interval will be a strong 
function of the mass. Low mass stars are formed in larger numbers and 
last longer, thereby exhibiting greater abundance. Since the process of star 
formation is extremely complex, it is not easy to obtain from first principles 
a distribution function for the number density of stars of different masses 
that is formed in the galaxy. One popular fit to the initial mass function 
(or IMF) which is extensively used by the astronomers is called the Salpeter 
IMF. This is given by £(M) x M~?:*° which is a simple power law. With 
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appropriate normalization, Salpeter IMF becomes: 
&(m)dm = 2 x 1072m? 35dm pe? yr7?; m=(M/Mo) (4.60) 


which gives the birth rate of stars of different masses in the range 0.4 [S 
(M/Mo) < 10 reasonably accurately. (A more detailed fit is given by Scalo 
IMF which has the form €(M) x M~” with three different values of n in 
three different ranges.) We will have occasion to use mass function in the 
later chapters. 


4.5.2 Life history of a star 


The evolution of a star, once it reaches the main sequence, with nuclear 
reactions triggered in the central region, depends essentially on its mass. 
Figure 4.6 gives the tracks of the stars in the L — T, plane as they evolve. 
.The diagonal branch running across (from top left to bottom right) gives 
the ZAMS based on Eqs. (4.30), (4.31). The following discussion highlights 
the key points and will walk you through the stellar evolution. 

To begin with, if M < 0.08Mo, nuclear reactions cannot be sustained in 
the contracting cloud for it to become a star. Such objects end up as planets 
or brown dwarfs, which are configurations in which the electron degeneracy 
pressure is significant and the material is fully convective. (To be precise, 
brown dwarfs have some nuclear reactions in the core but these are not 
the ones we are interested in for stellar structure.) Similarly, systems with 
masses higher than about (60 — 100)Mo are unstable and cannot last for 
significant period of time. Since radiation and thermal energies in a star 
scale as E, x T*R, Ey, x MT, it follows that E/Ey, « T?R3/M œ M? 
(where we have used the result T x M/R). In a sufficiently high mass star, 
the radiation energy will dominate over thermal energy and blow it apart. 
This sets the upper and lower limits for viable stellar mass. 

In the allowed mass range, stellar structure and evolution are charac- 
terized by some key values for the masses. Based on some features in the 
evolution which will be described below, we can divide the stars into low 
mass, intermediate mass and high mass stars. You will also recall that stars 
with M <1.3Mo have a radiative core while stars with M > 1.3Mo have 
a convective core. 

The first — and the longest — phase of stellar evolution occurs when 
stars act as gravitationally bound systems in which nuclear reactions fusing 
hydrogen into helium are taking place in the center. The process of com- 
bining four protons into a helium nuclei, releases about 0.03m,c? of energy, 
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Fig. 4.6 The approximate tracks of the stars, as they evolve, in the L — Ts plane. The 
thick diagonal line from left top to right bottom denotes the stars in the ZAMS. The 
evolution of stars of masses (1 —64)Mọo are shown above and to the right of this diagonal 
line by zig-zag lines. One of them with M = 4Mọ has a few phases of evolution marked 
by the letters a to e discussed in the text. The dashed lines are loci of constant radii at 
R/Ro = 1,10, 80 increasing from the bottom. 


which is about a fraction 0.007 of the original energy 4m,c?. Assuming 
that a fraction e ~ 0.01 of the total rest mass energy of the star can be 
made available for this nuclear reaction, the lifetime of the nuclear burning 
phase of the star will be about 


eM 9 € 
tms = = © 3x 10° yr (=) (4.61) 


for M ~ Mo and assuming the opacity is due to Thomson scattering so that 


Eq. (4.9) is applicable. This defines the key time scale in stellar evolution. 
A more precise fitting formula for this time scale is given by 


( tms Je 2550 + 669m? 5 + m45 M 


E | = Lla = —. 4.62 
108 yr 0.0327m > +0.346m'®? "T Mo a) 


In fact tms represents the longest stretch of time in a star’s life spent 
in steady state nuclear burning. Most of the scaling relations obtained 
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in Sec. 4.3 are applicable to such stars and they will form a band in the 
H-R diagram called the main sequence band. Since any given star spends 
maximum amount of time in the main sequence, it follows that a significant 
fraction of the stars that we observe will be found along the main sequence. 
Stars of different masses, of course, spend widely different amounts of time 
in the main sequence with the high mass stars spending less time. 

When the star is in main sequence, it is essentially converting hydrogen 
into helium in its core and maintains balance between the energy produced 
and energy radiated out. Even in the main sequence, its radius and lumi- 
nosity are varying slightly. For example, the 4Mo star shown in Fig. 4.6 
will move from the ZAMS to the point marked ‘a’ while burning hydrogen 
in the core. Eventually, at t > tms, the hydrogen in the core runs out and 
the stellar evolution moves to the next stage. Since the electrostatic po- 
tential barrier between the helium nuclei is higher, further reactions which 
fuse the helium nuclei cannot take place immediately and the core begins 
to shrink under its own weight, in thermal time scale, and heats up. The 
hydrogen surrounding the core follows suit and will be compressed to high 
enough densities so that nuclear reactions are again ignited in the hydrogen 
shell. This is a somewhat peculiar situation in which the core is collapsing, 
taking the surrounding hydrogen shell along with it, leading to the energy 
production from the shell. The luminosity of the star increases and the 
increased pressure pushes the outer layers of the star further from the cen- 
ter making the star move into a red giant phase. (It is ‘red’ because the 
surface temperature T, œ (L/R?)!/4 decreases with increasing R, making 
the black body peak shift to the redder side.) In massive stars (M = Mo) 
there is a distinct evolutionary track from ‘a’ to ‘b’ which is crossed very 
rapidly. The time scale for this is very short and is well fitted by 

tua 0.543 M 


2 i EE E 4. 
lms m3- 21m+233 "T Mo (263) 





The subscript HG stands for Hertzsprung Gap, as it is called. Since this 
time scale is very short, very few stars can be caught observationally in this 
phase thereby leading to a gap in the HR diagram in this region. Low mass 
stars do not go through this phase of the evolution does not have a distinct 
Hertzsprung gap. 

The next stage of evolution in moderate to high mass stars is usually 
called the giant phase during which the star essentially moves along the 
track marked ‘b-e’ (and similar tracks for other stars). The point ‘b’ is 
called the base of the giant branch and the track ‘b-e’ is called the giant 


Chapter 4: Stars and Stellar Evolution 169 


branch. The luminosity of the star at the base of the giant branch, Log», 
is essentially determined by its mass: 
Log 2.15 m? + 0.22m5 


M 
Lo 1.0+14x 10-2 m2? +5.0x 10-6 mi?” ~ Mo (4:64) 





The radius and the luminosity of the star while on this track is reasonably 
well fitted by 

5 = (0.25 194 + 0.8 19-87) m; m= ie l= = (4.65) 
The collapse of the core to a sufficiently high density will trigger the next set 
of nuclear reactions producing carbon from helium. This ignition usually 
takes place when the luminosity is about L = Lygp + 2000Lo. When the 
helium ignition occurs, the core expands slightly and the envelope contracts 
(which is just the reverse of the original evolution) causing the star to move 
down to ‘c’. (These fine tracks are not visible in the Fig. 4.6 due to lack of 
resolution.) The star now settles to a new equilibrium configuration with a 
helium burning core and a hydrogen burning shell. During this period, the 
luminosity of the stars remain roughly constant (earning them the name 
horizontal branch stars) at 


Lite 
Lo 


22 
=i 


Its radius, however, decreases making the star evolve along the horizontal 
line from ‘c’ to ‘d’. During its evolution in the horizontal branch, the 
stars convert the mass in the helium core (about 0.45 Mọ) into oxygen 
and carbon in almost equal proportion. These reactions typically release 
a fraction 7.2 x 1074 of the rest mass energy. Taking the luminosity of a 
horizontal branch star to be about 50 Lo, we find that the lifetime of a star 
in the horizontal branch is about typ © 7.2x1074x (0.45Moc?/50L@) = 0.1 
Gyr which is considerably shorter than the time spent in the main sequence. 
A more precise fitting function for the time scale for core helium burning 
is given by 





= 0.763 mos 20 +50m-°1; m (4.66) 
© 





(4.67) 


tms 


tHe _ | 0.0886 + 4.748 x 1074m + 1.186 x 107m? (if m > 10) 
(37.58 — 23.5 m + 4.406 m?)~! (if m < 10). 
Eventually, the helium in the core will get exhausted and a similar course 


of events take place again. The star will burn helium to carbon in an inner 
shell around the collapsing inert carbon core and, of course, hydrogen to 
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helium in another outer shell. In a sufficiently high mass star, there could be 
onion-like structure with different nuclear reactions taking place at different 
shells. A high mass star (M > 8Mo) can continue to burn hydrogen (for 
about 7 x 108 yr), helium (5 x 10° yr), carbon (600 yr), neon (1 yr), oxygen 
(0.5 yr) and silicon (1 day) leading to an iron core with the time scale spent 
in each phase being progressively shorter as indicated in the bracket. The 
overall giant phase of evolution lasts for about 0.15 tms. It is conventional 
to split this giant evolution into a red giant phase (before helium ignition) 
and a super giant phase (after core helium has been exhausted). 

The outer regions of the star can expand to very large radii (of sev- 
eral AU) at this stage. The temperature in the star’s envelope can get 
sufficiently low so that solid particles can condense out of vapour phase 
producing “dust”. (This is exactly how you get soot particles up in the 
chimney.) The dust particles will be blown away from the star by radiation 
pressure thereby leading to a stellar wind which can result in a mass loss 
of up to several times 1075 Mọ yr7!. 

The evolution is different for low mass stars which do not get hot enough 
to ignite anything beyond helium. During the later part of evolution, the 
outer regions of the star undergo violent instabilities leading to an eventual 
ejection of significant fraction of material in the form of a planetary nebula. 
The core region will become a white dwarf supported by the degeneracy 
pressure of electrons. To achieve this balance, the Fermi energy er of 
the electrons must be larger than the gravitational potential energy €g S 
Gm2N?/3n}/ [see Eq. (4.1)]. When the particles are non relativistic, €p = 
(h? /2me)(3n?)?/3n?/9 [see Eq. (3.12)] and the condition ep > eg can be 
satisfied (at equality) if 


2 Gm2m 
1/3 _ pile 2/3 
nl = Gaya (Se Nee (4.68) 


Using n = (3N/4r R?) and N = (M/mz,), this reduces to the following 
mass-radius relation: 


RM!” ~ ag Aem} ~ 8.7 x 1079 RoMg”. (4.69) 


A white dwarf with M ~ Mọ will have R ~ 10~*Ro and density p œ 108 po. 
The evolution of low mass stars usually leads to helium or carbon cores to 
survive as white dwarfs. We will discuss the structure of white dwarfs in 
detail in the next chapter. 

The end stages of high mass stars can be more dramatic. At the end, the 
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core collapses catastrophically and one is led to a supernova explosion and 
a neutron star as a remnant. In the late stages of the evolution, about 1.4 
Mo of original hydrogen gets converted into iron before the star explodes 
as a supernova. This reaction effectively releases a fraction 8.5 x 1073 of 
the rest mass energy. If we take the luminosity to be about 10°Zo, the life 
time turns out to be about 0.18 (L/10°Zo)~! Gyr. 

Several high energy processes become important in the core region of 
the star at this stage. For example, neutrino interactions with nuclei and 
cooling of the core through neutrino emission can be significant even at the 
silicon burning phase. Neutrinos interact only very weakly with matter and 
are characterized by a very small scattering cross section ø. For example, 
the reaction 7.(p,n)et (the compact notation A(B,C)D stands for the re- 
action A+B — C+D) has a cross-section o = 10748 cm?(e,,/1 MeV)? while 
the reaction ve(e~,e7)ve has a cross-section o = 10748 cm?(e,/1 MeV). 
The mean free path for neutrinos of energy e, % 1 MeV, corresponding to 
the first reaction is about À = (no)~! = (mp/po) ~ 10! cm pu! in c.g.s. 
units. This is about 10°Ro for typical stars and about (103 — 104)Ryq for 
white dwarfs. Therefore neutrinos can escape freely from stars and white 
dwarfs carrying energy with them. The main drain of energy arises through 
the reactions 


et +(Z,A) > (Z—-1,A) +> (Z,A) +e+0 (4.70) 


in which the kinetic energy of the electron is transfered to a vi pair. These 
reactions are significant at T > 5 x 109 K. At these temperatures, ete~ 
annihilations will also result in vi pair production in 1 out of 20 cases and 
production in the remaining 19 out 20 cases. The energy production per 
volume in this process is about 


4\ 2 
E X (nena) oc(keT) ~ (£z) oc(kpT). (4.71) 
kpT 

Using o ~ 10744 cm?(T/3 x 10° K)? we get this energy production rate to 
be € ~ 10% erg cm~3(T'/3 x 10° K)°. The corresponding cooling time at 
this temperature tcooi ~ (aT*/E) is about 2 hours, showing that this can be 
a significant drain of the energy. For a 20 Mo star, the photon luminosity 
during this phase is about 4.4 x 1038 erg s~! while the neutrino luminosity 

can be as high as 3.1 x 104° erg s71. 
At this stage, the photo disintegration of nuclei due to collisions with en- 
ergetic photons (which are present at high temperatures), as well as inverse 
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beta decay, reduces the pressure support of the core thereby triggering 
a rapid core collapse. The inverse beta decay also converts a significant 
fraction of electrons and protons into neutrons making the core neutron 
rich. As the core radius decreases and the density increases, the neutrinos 
produced by the weak interactions will get trapped inside. An order of 
magnitude estimate for v—trapping, which illustrates the scalings involved, 
can be obtained as follows: When the collapse of the core (mostly made of 
iron) begins, the physical parameters are about Pcore © 10-19 gm cm-3, 
tcollapse © (Gp)~1/? = 0.1 s and ep = 50 MeV (p12/te)'/3. The neutrinos 
get trapped when their diffusive time scale becomes less than the collapse 
time scale. Taking the nuclear composition to be that of a 56Fe core, the 
mean free path is about A, ~ 0.3 x 105077 3 om. If the size of the region 
is R and the number of scatterings are Nscat before the neutrino escapes, 
then the random walk argument gives R ~ NIŽA, and the diffusion time 
scale is 


Av R\? (x R p?” 
Taiff ~ Nscat (=) ~ (=) (> = Wy x< p7 x p. (4.72) 
A more exact calculation (taking into account the spherical geometry by 


solving the diffusion equation in the spherical coordinates) changes the 
formula only by a factor (3/7?) and we get 


3R? (p) 


=Y ~ 0.02 p12 sec. (4.73) 


Tait © 


This becomes larger than the free fall collapse time 


8 Sua : 
teolt = ( zee) = 1.3 x 1073p sec (4.74) 





when the density of matter is higher than ptrap with 
Ptrap 1.5 x 104 gm cm. (4.75) 


At higher densities, neutrinos are trapped inside the star for the dura- 
tion of the collapse. The neutrino density increases very fast and they 
soon become degenerate. Collapse proceeds further until nuclear densities 
Punel © [3 my /47(1.2 fm)?] ~ 10'4 gm cm? are reached. 

When the nuclear forces prevent further contraction of the core, a shock 
wave could be set up from the center to the surface of the star which 
is capable of throwing out the outer mantle of the star in a supernova 
explosion. The total kinetic energy of the outgoing shock is about 10°! erg 
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which is roughly one percent of the energy liberated in neutrinos. When 
the outer material expands to about 1015 cm and becomes optically thin, 
an impressive optical display arises, releasing about 1049 erg of energy in 
photons with a peak luminosity of about 1043 erg s7} ~ 109Lo. Such a 
supernova is one of the brightest phenomena in the universe in the optical 
band. The outgoing gas ploughs through the interstellar medium with ever 
decreasing velocity and eventually will appear as a shell like supernova 
remnant with a typical radius of about 40 parsec. The central remnant 
is a neutron star supported essentially by the degeneracy pressure of the 
neutrons. We shall discuss these remnants in detail in the next chapter. 
The neutrino trapping also has the effect of reducing their luminosity 
significantly. When the star collapses, from some radius R to a radius 
Ryuc at which the density is of the order of nuclear densities, the amount 
of gravitational potential energy released is of the order of (GM?/Rauc) 
since Rnuc < R. Writing pnuc ~ (3M/4rR3,,.), we get Rnuc © 12 km for 
M œ~ Mo. If this energy was released in the form of neutrinos, within a 
collapse time scale, then the luminosity can be easily estimated to be about 


_ (GM?/Rauc) 


Lu, max = 


~ 10°” erg s™!. (4.76) 
teoll 
On the other hand, neutrino trapping leads to the same energy being lib- 
erated over the diffusive time scale with Taig >> tco when p © Pnuc. AS a 
result, the actual neutrino luminosity turns out to be more like 

L, ~ (GM? / Rouc) ~ 10°? erg s7?. (4.77) 

Taiff 

Hence the bulk of the liberated gravitational energy is actually transformed 
internally to other forms of energy like thermal energy, excitation energy of 
nuclear state etc. rather than being lost to the system. This suggests that 
after the neutrino trapping occurs, the collapse proceeds almost adiabati- 
cally in the system. 

Equation (4.69) is still applicable to the remnant of a supernova explo- 
sion with Me replaced by the mass of the neutron, Mn; correspondingly the 
right hand side of Eq. (4.69) is reduced by (An/Xe) = (me/Mn) ~ 107%. 
Such objects — called neutron stars — will have a radius of R ~ 10~°Ro 
and density of p ~ 10!5po if M ~ Mo. For such values (GM/c?R) ~ 1 
and general relativistic effects are beginning to be important. (See chapter 
5 for a more detailed discussion.) 
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The nature and mass of the final remnant produced in the stellar evo- 
lution is broadly correlated with the initial mass of the star. By and large, 
white dwarfs are formed at the end of the evolution of stars with the initial 
mass of (1 — 8)Mgo and neutron stars are formed from the stars with the 
initial mass of (8 — 50)Mo. The formation rate of any stellar remnant can 
be easily estimated if the initial mass function for stars is known. If one 
uses the empirically determined initial mass function given by Eq. (4.60), 
then the birth rate B of stars in the range of masses (M1, M2) in the galaxy 
is given by 


(Mz/Mo) 
—1.35)(Mi/Mo) — +4 
B= Vai f pdm = 0.19m +35 yr (4.78) 
“Jmm ` lanmo) 
where 
Vaise = tr? (2H) = 1.3 x 10}! pc? (4.79) 


is the volume of the disk of the galaxy with vertical height H = 90 pc and 
radius r = 15 kpc which is appropriate for stars more massive than 1 Mo. 


The number density of compact objects formed from these parent stars will 
be 


(M2/Mo) De eee 
n=Tp f anes pdm = 0.018m line iM) PC 3 (4.80) 
where Ty œ 12 x 10° yr is the age of the galaxy. For white dwarfs, taking 
the mass range of parent star to be (1—8)Mo, we get an integrated galactic 
birth rate of 0.18 per year with the number density nwg œ 1.7 x 107? pe™ë. 
Neutron stars, on the other hand, originate from stars with initial mass 
(8—50)Mo and we get an integrated galactic birth rate of 0.01 per year with 
the number density nns ~ 1073 pe~3. The mean distance (n~1/%) between 
the white dwarfs should be about 3.8 pc while that between neutron stars 
should be about 10 pc. 

To determine the mass contributed by these objects, we have to take 
into account the fact that much of the initial mass will be lost during the 
evolutionary process. The white dwarfs and neutron stars have average 


masses of about < M >yaq=0.65Mo and < M >ns= 1.4Mọ so that 
2 _n<M> (4.81) 
Pgal Pgal 


where pgai © 0.14Mo pc™? is the mass density of the galaxy. This gives 


(e/Pgai) = 0.08 for white dwarfs and (p/pgai) = 0.01 for neutron stars. 
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Exercises 


Exercise 4.1 Simplest exercise in this book: 


Astronomers have discovered an unusual square object in the sky which appears 
to be 4 arcsec x 4 arcsec in angular size and has a magnitude of 10. (a) What 
is the surface brightness of this object in magnitude per square arcsec? [Hint: If 
you think it is (10/16) mag arcsec™*, you haven’t figured out the guiles of the 
astronomers!] (b) If the object is moved to a distance which is twice as far, what 
will be its magnitude? (c) What will be its surface brightness now? (d) If Sun is 
a perfect blackbody at T = 5800 K, what is the flux (in W m~?) you will measure 
if you were right on the surface of the Sun? This is the brightness of the surface 
of Sun. (e) From Earth, Sun appears as a disk of diameter 0.5 degree and the 
total luminosity of the Sun is 3.86 x 107° W. What is the flux per square arcsec 
of the Sun’s disk (in W m~? arcsec~*) which we measure for the Sun? This is 
the surface brightness of the Sun. (f) Express the surface brightness of the Sun 
in magnitude per square arcsec. 


Exercise 4.2 Practice with magnitudes: 


In a device which counts photons statistically, there will be a fluctuation around 
the mean number of photons observed. Assume that when you expect to observe 
N photons on the average, the actual value can fluctuate by an amount +V/N. 
(a) How many photons need to be collected if the magnitude of a star has to be 
measured to an accuracy +0.02? How long an exposure will one require with an 
one meter telescope to measure the B magnitude of an Mg = 20 star with the 
same accuracy? (b) Assume that the night sky has the same brightness as that 
produced by one 22.5 B magnitude star per square arc second over a bandwidth of 
940 A. Express the night sky brightness in erg cm~? s7! ster! ÅT}, (c) Consider 
an attempt to measure the magnitude of a Mg = 24 star within an accuracy 
of £0.02 magnitude. Assume that the starlight is spread out over 1 square arc 
second. How many photons are required to be collected to achieve this and what 
should be the exposure time of a 4 meter telescope for this measurement? [Ans: 
(a) N =% 2.9 x 10° photons; about 26 seconds. (b) F = 3.5 x 107° erg cm~? s7! 
ster“! ÅT}. ] 


Exercise 4.3 Scaling relation and molecular weight: 


Use the scaling arguments to determine the dependence of the luminosity of a 
star on the mean molecular weight, if the opacity is given by Kramer’s formula; 
i.e., prove Eqs. (4.35) and related results. Normalize this result using the Sun 
for which X = 0.70, Y = 0.28 and Z = 0.02. Hence determine how different the 
luminosity will be for a star with X = 0.98 and Z = 0.02 and no helium. 
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Exercise 4.4 Stars as polytropes: 


(a) The low mass stars (M < 0.3Mo) can be approximated as a y = 5/3 polytrope. 
One can then integrate the Lane-Emden equation to obtain various parameters. 
Observations suggest R/Ro = (M/Mo)°* in this range. Use this to determine 
the central density, pressure and temperature in terms of M/Mo. (b) High mass 
stars (M > 50Mo) can be approximated as a polytrope with y = 4/3 and has the 
observed relation R/Ro = 5.5(M/100Mo)°*°. Once again estimate the central 
parameters in terms of M/Mo. (c) Assume that the opacity is of the form « = 
kopl' °. Integrate the equation for radiative equilibrium and find a relation 
between temperature and pressure as a function of radius inside the star. (Ignore 
radiation pressure compared to gas pressure.) Use the results of (a) and (b) to 
determine «xo in terms of the mass and luminosity of star. 


Exercise 4.5 Degenerate to the core: 


The electron number density at the core of the Sun is about 4.4 x 102° cm7?. 


Are these electrons degenerate? What about the helium nuclei at the core of the 
Sun? 


Exercise 4.6 This won't do: 


If the Sun’s energy output is purely due to gravitational contraction, how long 
would it last? If the Sun were powered by chemical reactions (which releases, say, 
10* cal g~'), how long will it last? 


Exercise 4.7 Radioactive decay and light curves: 


The °°Ni decays to °°Co with a half-life of 6.1 days which, in turn, decays °°Fe 
with a half-life of 77.1 days. Estimate the form of the luminosity of a supernova 
as a function of time when the cobalt decay is dominant. For supernova 1987A 
take L = 10*? erg s~‘at t = 0 for the purpose of cobalt decay extrapolation. 
What is the total amount of energy released and the mass of nickel synthesized? 


Exercise 4.8 Bound on the neutrino mass: 


Neutrinos with energies (7.5 — 35) MeV were detected over a time interval of about 
12.4 seconds from supernova 1987A which occurred at a distance of about 50 kpc 
from Earth. Assuming that all these neutrinos were emitted simultaneously and 
the delay in arrival time is due to neutrinos of different energies having different 
speeds put an upper bound on the possible mass of the neutrino. [Ans. mic? < 17 


eV. | 


Exercise 4.9 Betelgeuse going boom: 


Betelgeuse is a super giant star of mass 20 Mo located at a distance of 200 pc 
from us. It is very likely to explode as a supernova within the next million years. 
Assume that this results in an iron core of mass 1.4 Mo, density of 10° gm cm~? 
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and that the core will collapse to a radius of 10 km. What is the amount of 
gravitational binding energy that is released? What is the free fall collapse time? 
If 1 per cent of the binding energy comes out as visible light in 100 days, what is 
the average luminosity of the supernova? What is the energy flux at Earth? Do 
you expect this supernova to out shine the Sun? 


Exercise 4.10 Decreasing star formation: 


Suppose that the star formation rate in our galaxy decreases as exp[—(¢/7)] where 
T = 3 Gyr and that the main sequence life time of a 3 Mo star is about 350 Myr. 
Compute the fraction of all the 3 Mo stars ever made which are still on the main 
sequence. [Ans. About 4.6 x 1073.] 


Exercise 4.11 Return gift: 


Using the relation between remnant mass and initial mass of the star described 
in the text and Salpeter IMF, estimate the fraction of the mass that is returned 
to the interstellar medium after 10 Gyr. 


Exercise 4.12 Growing old: 


Estimate what fraction of light from an old globular cluster comes from the stars 
which are still in the main sequence. 


Exercise 4.13 Infant mortality: 


Star formation occurs in giant molecular clouds which will result in a weakly 
bound cluster of stars. Consider a cluster with N = 10° stars and radius R = 1 
pe. Estimate the time scale over which such a star cluster will be disrupted by 
the shearing motion in the galaxy. How does it compare with its relaxation time? 


Exercise 4.14 Real life astronomy — Galactic center: 


Take a look at Dwek et al., Ap.J.,445, p. 716 (1995) which has an analysis of the 
infrared emission from the center of the galaxy. Take a look at figure 1 of this 
reference. (a) Using this picture, make a rough estimate of the total infrared flux 
in W m7’. (Be careful with the units!) Assuming a distance to this region to 
be 8.5 kpc, compute the luminosity of the central region in these wavelengths in 
terms of Lo. (b) Most of this light comes from red giants. (Why?) If a typical red 
giant is an MO type star, look up the stellar classification scheme to find the peak 
wavelength at which red giants emit. (c) You will find that the peak is not at 2.2 
um at which this data is taken. Hence the total luminosity needs to be corrected. 
Estimate the correction factor [Ans: It is about 14.5.] and recompute the total 
luminosity. If a single MO red giant has a total luminosity of 400 Lo, estimate 
the number of red giants in this region. (d) The number of red giant stars should 
be a small fraction of total number of stars. Can you devise a method to roughly 
estimate the total number of stars given the number of red giant stars? 
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Exercise 4.15 Real life astronomy - HR diagram: 


Get hold of Hipparcos data from the web (or from some friendly source!). (a) 
Write a program which will compute (i) the distance to each star in the data base 
from the parallax and then (ii) the absolute magnitude. It is better to stick to 
those stars which does not have too large a parallax error. (b) Plot the apparent 
magnitude against the B-V colour as well as the absolute magnitude versus the 
B-V colour. How do they compare? (c) Plot B-V colour against the V-I colour. 
Can you explain the nature of this plot? 


Exercise 4.16 Colourful astronomy: 


Verify the two curves in Fig. 4.1. It is rather trivial to show that the curve is a 
straight line for a class of sources with power law spectrum F, œ v~~ when a is 
varied. What is more surprising is that the colour-colour indices for the Planck 
spectrum seems to be a straight line. Actually it is not strictly straight. Work it 
out analytically and convince yourself why it looks so straight. Also obtain the 
slope of this curve in terms of the three wavelengths for B, U and V. 


Notes and References 


Stellar structure and evolution are discussed in detail in several text books like, for 
example, Padmanabhan (2001), Hansen and Kawaler (1994), Clayton (1983). 

1. Sections 4.3 and 4.4: The fitting formulas for the stellar evolution quoted 
in the text are from Eggleton et al (1989). A specialist text on supernova physics 
is Arnett (1996). 


Chapter 5 


Relics of Stars 


We saw in the last chapter that stellar evolution can lead to distinct types 
of remnants. The low mass stars end up as white dwarfs while the higher 
mass stars lead to supernova and neutron stars. All these types of remnants 
are of astrophysical interest on their own and we will discuss them (though 
somewhat briefly) in this chapter, beginning with the dynamics of the region 
around a supernova and its observational signatures. 


5.1 Supernova remnants 


When a supernova explosion takes place and stellar material is ejected, the 
local region of the interstellar medium, around that event, is significantly 
disturbed. The ejected material moves through the interstellar medium 
and will eventually be the source of electromagnetic radiation in different 
wave bands. The study of such objects, called supernova remnants, helps in 
the understanding of supernova explosions as well as several evolutionary 
processes in the interstellar medium (ISM). 

In a typical supernova explosion about E ~ 10°! erg of energy is released 
over a time scale which is short compared to any other relevant time scale 
in the problem. The initial velocity of the ejected matter, vej, is related to 
the energy E = (1/2)Mvz, by 

Vej & 104 km s 7) (Esf?m-"?2) = 107? pe yr HEH? m?) (5.1) 
where we use the notation, Es; = (E/10° erg) etc., m = (M/Mọo) and 
we have used the convenient conversion 10° km s~!~ 1 pc yr7!. Such 
an explosion will lead to a shock front propagating into the surrounding 
medium. 
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In the simplest picture for the supersonic expansion of a hot, gaseous 
sphere, there will be an abrupt discontinuity between the expanding gas and 
the material swept up by the shock front. A shock wave will run ahead of 
the contact discontinuity and the region between the ‘piston’ and the shock 
wavefront will be heated to a high temperature. In the case of strong shocks 
with high Mach number My, the ratio of densities on either side of shock 
wave is p2/p1 = (y+ 1)/(y—1) = 4 if y = (5/3) (see chapter 3, Sec. 3.3.3). 
The temperature ratio, (T2/T1) = 2y(y — 1)M?/(y + 1)? = (5/16)M? (for 
y = (5/3)), can be very large for strong shocks. Hence we expect the 
shocked gas in supernova remnants to be intense x-ray emitters. 

In the evolution of such supernova remnants, one can distinguish sev- 
eral different phases of evolution each of which has its own observational 
signature. In the first phase, the material moves as a blast wave with a 
constant velocity Vej given by Eq. (5.1) so that the radius of the remnant 
grows linearly with time: r ~ vejt. There is very little resistance to this 
blast wave in the ISM initially. As the mass swept up by the shock front 
increases, the blast wave will slow down and the r œ t evolution will end 
(at some radius r = ry, say) when the mass of interstellar medium swept 
up by the blast wave, Mswept * (47/3) pismr?, is comparable to the ejected 
mass Mej- This gives 


Maj \ PISM as 
~~? J —— : ‘ 
se ps (2) 2 x 10724 gm cm~3 Oa) 





The corresponding time scale is 


5/6 
t= af ~ 200 yr (2) Bes pag: (5.3) 
ej © 
The next phase of expansion is well described by the Sedov solution 
which we studied in chapter 3, Sec. 3.3.3 in which we found that r œ t?⁄5. 
This phase could also be interpreted as the one in which the energy is 
constant. The total energy of the system can be expressed as 


1/4 
Ew : (3) prev? + (constant) (pv?) (=) r (5.4) 


where the first term is the kinetic energy of bulk motion and the second 
term is the internal energy of the system. Constancy of E implies that 
r3y? = constant. Using v = ř and integrating, we obtain r œ (E/p)1/5t2/5. 


As we mentioned in chapter 3, Sec. 3.3.3 the constant of proportionality is 
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of order unity for y = 5/3. So this phase is characterized by the scaling: 


x [E a 2/5 ~ 1/5 part5 42/8 
r= - t 0.3 pe Esi le (5.5) 


The corresponding velocity and temperature are: 
-3/2 
v = * œ 5000 kms! G) BA? yp? (5.6) 
2 pe 


-3 -6/5 
T ~ 6 x 108K (=) Esi nyt = 10° K Eng” (i) ; 
2 pe 3 x 104 yr 
These scalings predict a temperature of about 3 x 108 K for a supernova 
remnant with an age of about 104 yrs. Plasma at such temperatures will 
radiate strongly in x-rays. Also note that Eq. (5.6) defines a characteristic 
time scale 


tsedov 3 10" yr Ty 5/6 El? n Pa (5.7) 


as a function of the temperature. For example, the supernova remnant 
Cygnus Loop has reached the end of this phase right now with Rpow œ 20 
pe and tpow © 115 km s~!. The age of this remnant, obtained from the 
above relation is about t ~ 6.5 x 104 yr. 

The Sedov phase, which conserves the energy, ends when the energy 
loss due to radiative cooling becomes significant. As the temperature falls 
below 10° K, the ions of C,N and O will acquire electrons and form atomic 
systems. The recombination, followed by the cascading down to the ground 
state through the emission of photons, will cool the gas. We saw in Chapter 
2 that the rate of cooling at these temperatures is given by n3,A(T) = 1072? 
erg cm? s~'n2,T, 1/2 (see Eq. (2.112)). The third phase will be the one in 
which the eoline losses are important. The transition to this phase can be 
estimated by the condition tooo S tsedov Where 


nkpT i TR”? 
teool = n2A(T) ~4x10 yr ( nH (5.8) 


and tsedov is defined in Eq. (5.7). The condition tcool 
Te $ Ex! "72/7 or equivalently, 


t œx T™? < 200 km s™* (Esing) 4. (5.9) 


tsedov leads to 


x 


This relation shows that the velocity at the end of sedov phase is about 200 
km s~! and is only weakly dependent on E and ny. 
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At this stage, the shock becomes isothermal. There will be a hot interior 
region [with radius r(t)] surrounded by a cool dense shell. The shell moves 
with approximately constant radial momentum piling up interstellar gas like 
a snow plough. (Since the shell is thin, we can describe it by a single radius 
r(t).) The cooling takes away most of the shock energy and the motion 
of the shell forward will be described approximately by the equation for 
momentum conservation: 


Z |(4n/3)pr°4] = 0. (5.10) 


If the thin shell is formed at a time to with radius r = rọ and velocity vo, 
then the first integral to the above equation is: 


(40/3) pr?# = (41/3) prévo. (5.11) 
This integrates to: 
1/4 3/4 
r= (1+42 e-t) ; =w (1442 ¢-)) . (5.12) 
TO TO 
For large t, we get r œ t!/4 and 
; : i -3/4 
LA —3/4 ~ = 
v=Trat ~ 200 km s (za 08 =) . (5.13) 


The constant factor in the above equation is fixed by equating the the sedov 
phase velocity of Eq. (5.6) to 200 km s~!. 

In the final phase, the speed of the shell drops below the sound velocity 
of the gas in the ISM, which is about (10 ~ 100) km s~!, in a time scale of 
about t œ (1 — 5) x 10° yrs. In about this time scale, the remnant loses its 
identity and it is dispersed by random motions in the interstellar medium. 
The complete evolution is shown schematically in Fig. 5.1 and is a nice 
example of different physical processes operating at different time scales in 
the same system. 

Supernova emits x-rays profusely during the first two phases from the 
hot material located behind the shock. The key emission process is thermal 
bremsstrahlung from the plasma at a temperature of 10° K or higher. The 
bright optical filaments are formed during phase 3 and their emission comes 
in the form of line radiation in the material with a temperature of 104 K. 
This emission is clearly characteristic of the radiating atoms. 

Supernova explosions and the eventual dispersion of ejected material 
have the effect of enriching the interstellar medium with the material pro- 
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Fig. 5.1 Different stages in the evolution of a supernova remnant, illustrated in terms 
of the behaviour of its radius, as a function of time. 


cessed in stellar interiors. In particular, the heavy elements synthesized 
inside a star reach the interstellar medium through this process. To make a 
simple estimate, let us assume that the supernova explosion can be approx- 
imated as an instantaneous appearance of a remnant of radius R = 100 pc 
which survives for a time of t ~ 108 yr. If the galaxy is modeled as a disk 
of radius 15 kpc and thickness of 200 pc and if the supernova rate for our 
galaxy is (1/30) per year (occurring randomly in the disc), then the volume 
averaged supernova rate is 


Renr ~ 2.3 x 107! pew3 yrat. (5.14) 


Assuming the supernova explosion is a Poisson process, we can estimate 
the fraction of the volume filled by the supernova remnants to be 


f=1-exp (- (F) R Rant) ~ 0.55. (5.15) 
In other words, more than half the volume of the galaxy would have been 


run through a supernova remnant. Any given point of the galactic disc will 
participate in this process, once in a period of about 10° yrs. Thus the entire 
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ISM is affected by the occurrence of supernova explosions. Since massive 
stars evolve at shorter time scale and are also more likely to end up as 
supernovas, the evolution of first generation of massive stars (Pop II stars) 
changes the character of the ISM. Second and later generations of stars 
(Pop I) condense out of this enriched interstellar medium and will have 
higher proportion of heavier elements. Since the older stars are created 
from the gas which is yet to be metal enriched by supernova while the 
younger stars are formed out of metal enriched gas, it is obvious that there 
should be a direct relation between the age of the star and the fraction of 
all elements other than hydrogen and helium present in the star, usually 
called metallicity. 

Also note that the total amount of energy released in supernova ex- 
plosions over the life time of the universe Hp! ~ 10h~!Gyr (where 
Ho = 100hkm s~!Mpc™! is the current expansion rate of the universe 
and h = 0.7; see chapter 6) is about 4000 h?Es51; for comparison, the bind- 
ing energy of visible part of the galaxies is about 1300 h? if the galaxies 
contain about ten times the dark matter as visible matter (see Chapter 8). 
Supernova explosions are energetic events even globally. 

In addition to these x-ray and optical emissions, supernova remnants are 
also strong sources of radio waves. This is due to the synchrotron radiation 
from the electrons spiraling in the magnetic fields. We saw in chapter 2, 
Sec. 2.4.2 that, if the differential number density of relativistic electrons 
per unit volume is taken to be 


n(E)dE = KE? dE (5.16) 


then the total synchrotron flux at frequency v, from an optically thin source 
at a distance d, can be expressed as 


V 1 e3 3e (p—1)/2 
S, = —G K BOt+p)/2 p-(p-1)/2, ey R 
d? j i ii 4n me? \ 4rm3c5 
(5.17) 


where V is the volume of the source and B is the average magnetic field 
[see Eq. (2.58); note that Sy = Vj,/(4md?)]. We will first determine the 
dependence of the flux S, on the radius r of the remnant, so that one can 
determine p from the observations. To do this, we need to first fix the 
dependence of B and K on r. In the case of a supernova remnant, the 
gas at T > 104 is strongly ionized during the Sedov phase and hence the 
magnetic field will be frozen to the plasma fluid (see chapter 3, Sec. 3.4.2). 
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The conservation of magnetic flux implies 
2 
B(r) = Bo (2) (5.18) 


showing B x r~? as the remnant expands. Next, to determine the ra- 
dial dependence of K, we will use the conservation of the number of rel- 
ativistic electrons. If the energy density of relativistic electrons decay 
due to adiabatic expansion of the volume, the energy density e varies 
as d(eV) = —PdV. Taking the pressure of relativistic electrons to be 
P ~ (1/3)e, this equation integrates to give e x r74. The total energy 
of the electrons E = eV decays as r~! allowing us to write 


E(r) = Eo (2) I (5.19) 


This, in turn, implies that the condition for the conservation of total number 
of electrons 


E2 r 3 pEaro/r 
Vo KoE ’”dE = Vo (=) J. K(r)EPdE (5.20) 
Ey ro Exro/r 

can only be satisfied if (K(r)/Ko) = (r/ro)~+”) which fixes the scaling 
of K(r). Using this result and Eq. (5.18) in Eq. (5.17), we get S (r) = 
S.(ro)(r/To)~2” = Sv(to)(t/to)~4”/> where we have used the Sedov relation 
r x t?/5. The surface brightness X, = (S,/mr?) of the remnant scales as 


—2(p+1) 
E,(r) = E (ro) (5) (5.21) 


which relates observable quantities to the index p for electron distribution. 

While the scalings are easy, the actual numerical estimate of the syn- 
chrotron emission would require an estimate of the magnetic field in the 
remnant which is difficult to determine. One way of addressing this prob- 
lem is as follows: It is possible to show that the total energy content of the 
system (emitting certain amount of synchrotron radiation) has a minimum 
as a function of the magnetic field strength. If we further assume that 
the system is in the minimum energy configuration, we can determine the 
magnetic field. 

The total energy density of a source containing electrons of number 
density ne and a magnetic field B is given by 


2 


B 
Ptot = Pe + PB = neyme? + ae (5.22) 
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We will now show that, for a given measured value of the synchrotron 
flux jv, the first term varies as B~9/?; that is, pe x pR” 1. Hence, for a 
given flux, the total energy density has a minimum as a function of the 
magnetic field. To obtain this scaling, note that the synchrotron emissivity 
is proportional to jy X neP/ve where P œ y?B? is the power emitted by a 
single electron and Ve « 7?B is the frequency at which synchrotron peaks 
(see Eq. (2.52)). This gives jp « neB which can be equivalently expressed 
as 


iCal: Mee OP R 
7 cs 3(87)1/4,/2nqme 


Hence, for a fixed jy, we have pe « pee $ leading to a distinct minimum for 
(Pe+prg). Minimizing ptot now gives the minimum condition to be the one in 
which the energy densities are essentially in equipartition, pe © pg within a 
numerical factor of order unity which is unimportant. Since energy densities 
are proportional to the pressure, this condition is equivalent to pressure 
equilibrium between the magnetic field and the relativistic electrons. 

To get a feel for the numbers involved, let us consider the supernova 
remnant Cas A. This has a flux of 2.72 x 10773 Watts m~? Hz~! and an 
angular diameter of 4’. Assuming the source is spherical and is about 2.8 
kpc away, we get the luminosity at 1 GHz to be about 2.6 x 1018 Watts Hz! 
and the volume of the source to be 5.3 x 105° m3. We will approximate the 
total emission from this system to be about 2.6 x 1027 Watts. In that case, 
a simple calculation gives 


= 6 x 107}? c.g.s. (5.23) 


Bmin % 2 x 1074 Gauss;  pminV © 104! J. (5.24) 


These numbers are quite appreciable. For comparison, the kinetic energy of 
optical filament in the source is about 2 x 1044 J and the rest mass energy 
of 1Mọ is about 2 x 1047 J. There must exist a fairly efficient mechanism 
for converting the explosive energy of supernova into particles and mag- 
netic field to lead to values indicated by Eq. (5.24). The modeling of such 
remnants is an active area of research. 

The magnetic field also plays another — completely different — role in 
supernova remnants which is worth mentioning. In studying the supernova 
shock propagation, we have used the fluid description for the matter. A 
closer look shows that fluid approximation leading to standard shock wave 
theory (developed in chapter 3, Sec. 3.3.3) may not be applicable! An 
ejection velocity 104 km s7! corresponds to individual kinetic energy of 
about 2 MeV for, say, each proton. When such protons hit the hydrogen 
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atom, the latter will be ionized; the cross section for this process is about 
Tion © a2 ~ 1071” cm? and the energy lost per ionisation is about 50 eV. If 
the medium has a density of about 1 hydrogen atom per cubic centimeter, 
the stopping length l for a 2 MeV proton is about 


2 MeV 1 3 
tm (ZEY) —— = 10 pe. (5.25) 





This is larger than any macroscopic length scale of interest in our sce- 
nario and hence one suspects that fluid approximation is inapplicable. The 
situation is, however, saved because the interstellar medium contains mag- 
netic fields which affect the motion of charged particles. A magnetic field 
strength of, say, B ~ 3 x 1076 Gauss would result in a Larmor radius of 
about Ry, ~ 2 x 10!° cm = 1078 pc. Even though the energy density 
of such a magnetic field is not of much direct dynamical consequence, Rr 
will play the role of effective mean free path leading to the formation of a 
collisionless hydromagnetic shock. 

Similar situation arises in the study of many other plasma systems, like 
e.g., the solar wind in the vicinity of earth’s magnetic field. Whenever 
the Larmor radius is small, it can act as the effective mean free path for 
the scattering of charged particles. The ratio between the mean free path 
due to standard Coulomb collisions [1 œ (T?/n)| and the Larmor radius 
[Rr x (T1/?/B)| varies as (BT?/*/n) and can be large in tenuous high 
temperature plasmas with strong magnetic fields. This ratio is unity for a 
critical magnetic field 


—3/2 
B: = 10719 Gauss (mx) (=) . (5.26) 


1 cm73 


The magnetic field in most astrophysical plasmas will be much larger than 
B, and hence the above effect will be important. In the supernova, the 
magnetic field itself is swept up by this process and is collected along with 
the gas by the moving shock front. The shock scenario here is thus more 
complicated than the normal collisional shock and one does expect part of 
the energy deposited by the shock to appear as relativistic kinetic energy of 
a fraction of the charged particles. In spite of these differences, the Larmor 
radius acts as the effective mean free path for the system and renders the 
fluid approximation at least approximately valid. 
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5.2 White dwarfs 


We saw in the last chapter that at the end of stellar evolution, when the 
nuclear fuel in the star is exhausted, the gravitational force will start con- 
tracting the matter again and the density will increase. Eventually, the 
density will be sufficiently high so that the quantum degeneracy pressure 
will dominate over thermal pressure. Such astrophysical objects, formed at 
the end of stellar evolution, are usually termed as compact since — as we 
shall see — their sizes are significantly smaller than main sequence stars 
of similar mass. Depending on the average density, these compact objects 
can have different internal structures which we shall now discuss. 

For densities in the range of 10° gm cm~? < p < 109 gm cm~3, the 
remnant is made of a non degenerate ideal gas of ions and a degenerate gas 
of electrons. To get a feel for such an object, let us work out the numbers. A 
typical example could be a carbon white dwarf with temperature Teg = 2.7 x 
104 K and mass M = 1Mo. To begin with, these numbers show that the 
white dwarfs are fairly faint objects and are difficult to detect. For example, 
a 4 m class telescope can reach a bolometric limiting magnitude of about 
24.7. The radius of the white dwarf, estimated as R = (L/4noT4,)'/?, 
is about 5000 km. Consider such a white dwarf located at a distance r 
from Earth. The flux received from it will be F = oT4(R/r)*. Using the 
fact that Mboi = 0 corresponds to Fy = 2.75 x 1075 erg cm~? s7! (see 
Eq. (4.14)), we can easily verify that the limiting bolometric magnitude is 
reached when r ~ 2.5 kpc. Thus only nearby white dwarfs can be detected 
by such telescopes. Since they are significantly dimmer than the normal 
stars, they will lie to the left and below the main sequence in a HR diagram. 


Given the mass and radius, we find that the mean density is p ~ 
(3Mo/47R°) % 4 x 10° gmcm-%. For this density, one can estimate 
the mean distance between the carbon ions as di ~ (p/mc)~1/3 œ 0.02 
A where mc œ% 12my is the mass of carbon nucleus. This is smaller 
than the radius of the carbon atom ro œ% (ao/Z) œ (ao/6) = 0.08 A. 
Hence the electrons will be stripped off the nuclei and will exist as a sepa- 
rate fermionic gas. Further, the mean distance between the electrons will 
be dee © (Zp/mc)71/3 = 107? A while the thermal wavelength of the 
electron is about Ae = (h?/mekpT)\/? ~ 10 A. Since Ae Z dee, quan- 
tum mechanical treatment is required. To decide whether the electrons 
are relativistic or not, we should compare mec with the Fermi momentum 
pr = (37?)'/3pnt!? = (3n2)/3h(p/ue)/3, where pe = (p/NeMp) is the 
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number of baryons per electron. The Fermi momentum of the electron will 
be equal to mec at the density p = pe where 
3 
Pe = T ppe =E ~ 10fue gm cm™?. (5.27) 
In our case the actual density is marginally in relativistic domain but one 
can just about get away with a non relativistic treatment. 

The equilibrium condition for such a system will require the degeneracy 
pressure of matter to be large enough to balance gravitational pressure. 
Equivalently, the Fermi energy ep(n) must be larger than the gravitational 
potential energy eg ~ Gm2N?/3n}/3, When the particles are non rela- 
tivistic, ep (n) = (h?/2m_)(3n7)?/3n?/5 and this leads to Eq. (4.68). Using 
n = (3N/4r R?) and N = (M/mp), one obtains the following mass-radius 
relation: 


RM? ~ ag! Aem}? ~ 8.7 x 1073 RoM4” (5.28) 


for the white dwarfs. A white dwarf with M ~ Mọ will have R œ 107*Ro 
and density p ~ 10% po. 

At lower densities, (p < 10° gm cm~%), ions form a crystal lattice struc- 
ture and the electron density is nonuniform over the scale of the lattice. The 
coulomb interactions become important in deciding the structure of the 
matter and the equation of state becomes stiffer; eventually, the electro- 
static forces lead to the occurrence of nonzero density at approximately 
zero pressure, as in the case of normal solids made of non degenerate 
matter. The Coulomb interaction between electrons is characterized by 
the energy Ecoul = (Ze*/dee) © (Ze?n}/ 3) where ne is the number den- 
sity of electrons. The ratio between Coulomb energy to Fermi energy is 
(Ecou/€r) ~ Z?/3.a(p,/p)'/3 where a is the fine structure constant. When 
Ecoul K €r, one can treat the gas as ideal. This approximation breaks 
down at low densities (9 < p.Z*a? ~ Zu.) and the Coulomb interac- 
tions lead to the formation of a crystal structure with finite density at zero 
pressure. The ratio of electrostatic energy of nearest neighbours to the 
thermal energy, T'e = (Z2q?/akgT), where a is about the Bohr radius, de- 
termines whether Coulomb effects are important or not. I = 1 occurs at 
p = 85(T/108 K)? gm cm~%. Large values of (= 170) lead to crystalliza- 
tion. 

Since the white dwarfs actually have a nonzero temperature, they will 
have a finite luminosity due to radiation of heat energy. This will, in turn, 
make them cool and their location in the HR diagram will change. To 
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study the evolution of the track of white dwarfs in the HR diagram, it is 
necessary to model its cooling and the resulting change of the temperature. 
The luminosity will be determined by the nature of the physical processes 
taking place near the surface of the white dwarf. But the equation of state 
for degenerate matter will be inapplicable near the surface of a white dwarf 
since (whatever may be the internal structure) we expect p — 0 near the 
surface. Hence the surface layer of the white dwarf needs to be treated 
separately along the following lines: 

Let us assume that the internal density of a white dwarf decreases from 
some central value pe at r = 0 to p = O at the surface, r = R. Then 
the equation of state will cease to be that of a degenerate gas at some 
intermediate point r = rg; that is, for r < ro, we shall assume that the 
equation of state is that of a degenerate gas while at r > ro, the equation 
of state is that of an ideal gas. Since the degeneracy is decided by the 
ratio (er/kgT) — see Eq. (3.13) — it makes sense to determine 79 by the 
following condition: At r = ro, the Fermi energy er = (p}/2me) of the 
system will be equal to the thermal energy kgT. This condition gives 


po = MpHe h a 2kpTo 3/3 (5 29) 
2 3n? \ mec Mec? j 
which provides the relation between the density pọ and temperature Tọ at 
the transition point. Numerically, in c.g.s units, we have: 











po = 6 x 1079 pe TH”. (5.30) 


Further, we will argue that the large thermal conductivity of the degenerate 
electron gas ensures that the temperature is same (T = To) for all r < ro. 
To see this, we need to estimate the thermal conductivity of the degenerate 
electron gas —- which comes under the general class of parameters known 
as transport coefficients. Since this concept is important, we will take a 
moment to set up the general formalism and then will apply it to compute 
conductivity. 

The context we want to study is a very common situation in which a 
system is not in thermal equilibrium globally, but can still be described as 
in a local thermodynamic equilibrium with the parameters like tempera- 
ture, T(x), varying gradually in space. In this case, the gas will exhibit 
gradients in temperature, density and velocity fields. When these gradi- 
ents are small, one can study the effects of these gradients — to the lowest 
order in their variation — by introducing what are conventionally called 
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transport coefficients. In this description, density gradient leads to diffu- 
sion of particles, temperature gradient leads to conductive heat flux and 
velocity gradient leads to viscosity. The relevant transport coefficients are 
coefficients of diffusion, thermal conduction and viscosity. We will work it 
out for diffusion and you can easily extend the idea to find other transport 
coefficients. 

Consider a gaseous system in local thermodynamic equilibrium in which 
the particles are moving randomly in all directions. Let AA be a small area 
in some plane perpendicular to y-axis which we take to be along the vertical 
direction. The number of particles crossing this area AA , in unit time, 
either from above or from below will be (nvAA/6). The factor 6 arises 
because, on the average, one third of particles will be traveling along the y- 
axis of which half will be traveling downwards. The particles collide, on the 
average, within one mean-free-path | = (no)~1. Hence the particles coming 
from above will transport the heat energy at the point (y + l), while the 
particles going from below will transport the heat energy at the point (y — 1) 
in the opposite direction — thereby leading to thermal conduction. Suppose 
the density of the particles varies along the y—axis. The net number AN 
of particles which crosses the area AA per second will be 


AN = (3) AAv [nly +1) — nly —D] ~ G) (AA) ul ($=) (6.31) 


The flux of particles along the y—axis, Jy = (AN/AA) is conventionally 
expressed as Jy = D(On/Oy), where D is called the coefficient of diffusion. 
Comparison with Eq. (5.31) shows that 


De (3) (ul) (5.32) 


which is a general result for coefficient of diffusion. 

Coming back to the case of white dwarfs, the mean free path is | = 
(nio)! where o ~ (2Ze?/m,.v?)? is the relevant cross-section for charged 
particle interaction. This gives 


1 Me \2 v® 
Dea. (Za) m oe 


Ile 





If p ~ 10° gm cm~ and v ~ up ~ c then we get D ~ 200 cm? s7!; the 
corresponding diffusion time scale in the white dwarf interior is toond © 
(R?/D) ~ 10° yr which is quite short compared to the age of the galaxy. 
Hence thermal conduction can quickly equalize the temperature at r < ro 
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and we may take T = To for all r < ro. It is the thin surface layer of non 
degenerate matter near the surface, ro < r < R, which prevents efficient 
cooling of white dwarfs and determines its luminosity. 

We can now find the scaling relation, relating the luminosity of the white 
dwarf to other parameters. We saw earlier (see chapter 4, Eq. (4.37)) that in 
a radiative envelope p x (M/L)1/?T?-?5, At the point of matching (r = ro) 
between the degenerate core and the envelope, Eq. (5.30) requires that we 
must have po « T;/*. Combining the two relations we get (M/L)\/2T3:25 œ 
TÈ / 2. or La M TË POM pie where the last proportionality follows from 
the fact that core is isothermal. The characteristic cooling time for such a 
system is 


U MTrore — 
tcool © I% 772 X Te. (5.34) 


core 





This leads to the scaling law L x M pe xM foo 
Combining the relations L x T and L œ T7/?, one finds that T œ 
T’/? or, equivalently, È « —L'?/7. Including all the numerical factors and 


integrating, the luminosity evolution of white dwarfs is found to be 


-1 5/7 -2/7 -5/7 -5/7 
t = 6.3x 10° yrs A M (£) Z) _ (£0 
12 Mo 2 Lo Lo 
(5.35) 
This expression relates the time ¢ to the luminosity L(t) if the initial lu- 


minosity (at t = 0) was Lo. Ignoring the Lo term at late times (when 
L < Lo), we get 


ANI Pa! LNI 
x 6 es ae a 
t = 3.9 x 10° years (5) (©) (=) ; (5.36) 


Let N(L) denotes the number density of white dwarfs in the range of lumi- 
nosities (L, L+ dL); then the rate of change of this number density is given 
by 


. dN [(i\ (aN 1 dN L\*!"/ aN 
dL L dink t(L) din L M dla L 
(5.37) 
If the white dwarfs are produced at an approximately constant rate, then 
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we expect N = constant giving 


dN LYA 


Thus the luminosity function, giving the number density of white dwarfs 
per logarithmic interval of luminosity, is a power law with slope (—5/7). 
We do expect this power law to be truncated at the extreme limits. At 
low luminosities and temperatures, crystallization will modify the structure 
while at very high luminosities and temperatures, one has to take into 
account effects due to neutrino cooling. In the intermediate range, we 
expect the above power law to hold. This simple result agrees reasonably 
well with observations. 

The coolest white dwarfs which have been observed have L = 10745Lo 
with a cooling time — given by Eq. (5.36) — of about 10!° years which is 
comparable to the age of the galaxy. A dramatic drop in the population 
of white dwarfs is observed in our galaxy around L ~ 10-4°Lo. This is 
consistent with an age of (91.8) x 10° years for our galaxy. Adding the time 
spent in the pre white dwarf stage in the stellar evolution would suggest 
that the star formation in the disk of the galaxy began about (9.3+2) x 10° 
years ago. This is about 6 billion years shorter than the age of the globular 
clusters which are distributed in a more spherically symmetric manner in 
our galaxy and are not confined to the disk. This suggests that the disk of 
the galaxy formed later than the halo — a scenario we will come back to 
in chapter 7. 


5.3 Neutron stars and pulsars 


At densities p > 10° gm cm~? the electrons can combine with the protons 
in the nuclei and thus change the composition of matter. The equation 
of state at higher densities needs to be determined by studying the the 
minimum energy configuration of matter allowing for nuclear reactions. The 
details are quite complicated but a simplified analysis — which captures the 
essential physics — can be based on the idea that, electrons can combine 
with protons through inverse beta decay to form neutrons, which can then 
provide the degeneracy pressure that will balance gravity. Equation (5.28) 
is still applicable with me replaced by mpn; correspondingly the right hand 
side of Eq. (5.28) is reduced by (An/Ac) = (Me/Mn) ~ 1073. Such objects 
— called neutron stars — will have a radius of R ~ 10~-°Ro and density 
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of p = 10!5p6 if M ~ Mo. For these values (GM/c?R) ~ 1 and general 
relativistic effects are beginning to be important. 

When such a neutron star forms as a result of a collapse, its angular 
momentum is conserved to a large degree. If the initial angular momen- 
tum of the pre-collapse core was Ji, = M Nin R2,» then the conservation 
of angular momentum, Jin = Janai, requires Nanai S Qin(Rin/Ranai)?. If 
the ratio between the initial and final radii is about 10° and if we take 
Qin = (27/20 days), this leads to a fairly rapid rotation for the neutron 
star with Qanal © (27/1074 sec). 

As the core collapses to form a neutron star, its electrical conductivity 
becomes very high. In the limit of infinite conductivity, the magnetic flux 
through the star, ® « BR?, will also be conserved (see chapter 3, Sec. 3.4.2). 
This leads to the final magnetic field of Banat = Bin(Rin/Ranai)?. If 
Bin © 100 Gauss and (Rgnai/Rin) ~ 10°, we get Banari œ% 101? Gauss 
and hence the neutron star will be threaded by a very strong magnetic 
field. Such rotating, magnetic neutron stars emit a characteristic pattern 
of pulsed radiation with remarkably stable period. These objects, called 
pulsars, are extensively studied and are used as probes of several other as- 
trophysical phenomena. For example, isolated pulsars allow the probing 
of the interstellar electron density and magnetic field by the processes of 
plasma dispersion and Faraday rotation described in Sec. 3.4.2. Pulsars in 
binary systems and clusters allow the study of a host of other phenomena 
related to evolution of binary stars. They also act as clocks located in 
strong gravitational fields thereby helping one to test specific predictions 
of general relativity. 

The energy emitted by a pulsar span a wide band of wavelengths — from 
radio to hard x-rays — with most of the energy being emitted at higher 
frequencies. In spite of this fact, pulsars are most conveniently observed 
in radio frequencies and — over years — significant amount of data has 
accumulated about radio pulsars. There are now more than 1200 pulsars 
which are believed to be a galactic population of isolated neutron stars 
whose radiation is powered by rotation. The periods of rotation range from 
milliseconds to several seconds, with most of the pulsars being confined to 
periods of 0.2 to 2 seconds. As the rotational speed of the pulsar decreases, 
the period will increase, with the typical rate of change of period being 
È = 107}5, But for this slow decay, the period of the pulsar happens to be 
remarkably stable, making them excellent clocks. One of the most exten- 
sively studied pulsars is the Crab pulsar, from which radiation is observed 
over 60 octaves from 75 MHz to 500 GeV. 
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To the lowest order of approximation, one can think of pulsars as rotat- 
ing magnetic dipole moments emitting magnetic dipole radiation. Though 
this model is too simplistic, it leads to the correct order of magnitude esti- 
mate for radiation and hence is worth discussing briefly. 

Consider a rotating neutron star with a magnetic dipole moment m, 
radius R and angular velocity Q. Let the direction of m be inclined at an 
angle a to the rotation axis. In that case, the magnetic moment m varies 
with time as 


1 
m= 5 BR (ej cosa + e1 sina cos Nt + e' sina sin Nt) (5.39) 


where eq is the unit vector along the rotation axis and e1,e’, are (fixed) 
mutually orthogonal unit vectors perpendicular to ej. Note that the mag- 
nitude of the magnetic moment at the pole, a = 0, for a magnetized sphere 
of radius R and magnetic field B must be |m| = (BR*/2), which fixes the 
overall scaling in the above equation. Any time varying magnetic dipole 
moment emits radiation at a rate 


dE 


dE P= _ BPR®0A sin? a 
dt eo 


2 |. 
T ~ 33 6c3 


(5.40) 
(This is completely analogous to the electric dipole radiation discussed in 
chapter 2, Sec. 2.1.1. Equation (5.40) is the magnetic counter part of 
Eq. (2.8).) This radiation is emitted at the frequency of rotation Q of the 
pulsar, and vanishes in the case of an aligned rotator with a = 0. Assuming 
that the energy is supplied by the kinetic energy of rotation, K = (1/2)I?, 
it follows that 

dE _ og = BRO sinta 


= INQQ= 


a eal (5.41) 


Using the expression for (dE /dt)}, we can define a characteristic time scale 
T by 
3 
= (3) eae Lae (5.42) 
Q/ 5 B* R802 sin“ a 


where the subscript ‘0’ denotes the current value. Integrating Eq. (5.41) 
and fixing the initial conditions correctly, we get 





202 4\ 71? 
= z) (5.43) 
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n= (1+ 
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where Q; is the initial angular velocity at t = 0. This equation gives the 
angular velocity of the pulsar as a decreasing function of time; clearly, the 
pulsar is spinning down due to energy loss. Setting Q = Qo gives the present 
age of the pulsar to be 


2 

These formulas can be used to make some simple estimates regarding the 
pulsar emission. The Crab pulsar, for example, has originated as a result 
of supernova explosion which occurred in the year 1054 AD and hence is 
about T = 103 years old. Its current period, P = (27/bo), is about 0.033 
seconds. Estimating |(| = (Qo/T) = 1078 s72, we can calculate the energy 
loss rate to be about (dE/dt) = 6.4 x 10°8 erg s7!. If sina ~ 1, then 
Eq. (5.42) allows us to determine the magnetic field to be B = 5.2 x 107? 
Gauss. These values appear to be reasonable. 


5.4 Black holes 


When the density is still higher, the Fermi energy has to be supplied by 
relativistic particles and ep now becomes ep ~ hicn'/3, This expression 
scales as ep œ n'/, just like the gravitational energy density, eg in Eq. (4.1). 
Therefore, the condition ep > €g can be satisfied only if hc > G m? N?/3 
or, only if N < ag” a Noa73/?. The corresponding mass bound (called 
Chandrasekhar limit) is M S Men where 


Men ~ Mp ag ~ 1 Mo. (5.45) 
(A more precise calculation gives a slightly higher value.) Thus, when the 
mass of the stellar remnant is higher than Meh ~ ag” Pmp, no physical 
process can provide support against the gravitational collapse. 

The difference in the behaviour of the equilibrium radius R(M) for a 
body of mass M in the non relativistic and extreme relativistic limits is a 
simple consequence of the limiting forms of the equation of state. The same 
result can be interpreted in terms of the pressure balance along the following 
lines. Pressure balance against gravity requires the ratio ¢ between the 
gravitational pressure (GM7/R*) and the degeneracy pressure P to be some 
constant of the order of unity. Taking P « p” « (M/R°)7, this ratio 
becomes 


C = (GM?/PR*) x M?77 RT, (5.46) 
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For a non relativistic system with y = 5/3, we have x M'/3R. In equi- 
librium, ¢ will have some definite numerical value; if M is now increased, 
equilibrium can be restored by R decreasing so as to maintain the con- 
stancy of M!/3R. Hence a system supported by non relativistic degeneracy 
pressure will contract if more mass is added. On the other hand, for the 
relativistic systems with y = 4/3 we have Ç x M?/3 ~ (M/M.)*/3, which 
is independent of the radius. If M < Men, pressure exceeds gravity and the 
system will expand reducing the density and making the system (eventu- 
ally) non relativistic; then an equilibrium solution can be found at a suitable 
value of M and R. But if M > Men, the gravitational force exceeds the 
pressure and the system will contract. But since ¢ is independent of R, the 
contraction will not lead to a new equilibrium configuration and the system 
will continue to collapse under self gravity. In such a case, the star will 
form a black hole and is likely to exert very strong gravitational influence 
on its surroundings. 

Such a body necessarily has to be described by general relativity with 
a metric in Eq. (1.79) 


-1 
d? = {1— gen edt? —(1- 26M dr? — r? (d8? + sin? bdg’) . 
cr Cr 
(5.47) 
This leads to a wide variety of curious phenomena, of which we will discuss 








a few. 

An immediate consequence of this metric is that radiation emitted by a 
body having such a strong gravitational field will be significantly redshifted 
as it propagates towards infinity. If w(r) is the frequency of radiation 
emitted by a body of radius r and wo is the frequency with which this 
radiation is observed at large distances, then our discussion after Eq. (1.75) 
shows that wo. = w(r)(1 — 2GM/c?r)}/?; that is, radiation from near r > 
2GM/c? is redshifted by a large factor. 

To explore this in greater detail, consider a wave packet of radiation 
emitted from a radial distance re at time te and observed at a large distance 
r at time t. The trajectory of the wave packet is, of course, given by ds? = 0 
in Eq. (5.47) which — when we use dô = dd = 0 — is easy to integrate. 
We get 

2GM 1—2GM/c?r 4GM We 
c(t- te) = r=ret—7 In (seman = sai = etek ae n (25) , 
(5.48) 


In the last equality we have expressed the result in terms of the frequency 
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observed at r. This result shows that, when the radiation is emitted from 
a location just outside the Schwarzschild radius (ie., re > 2GM/c”), and is 
observed at a large distance (ie., r >> 2GM/c?) the frequency of radiation 
measured by an observer at infinity is exponentially redshifted: 


w(t) ox exp —(c?t/4GM) = K exp —(xt/c) (5.49) 
where K is a constant and we have introduced the parameter 


_ c a GM 
~ 4GM ~ (2GM/c?)2 


K (5.50) 
which gives the gravitational acceleration (GM/r?) at the Schwarzschild 
radius r = 2GM/c?. This quantity, called the surface gravity, plays an 
important role in black hole physics. 

Incidentally, this result tells you that the wavefront ¢(t) has a phase 
which itself is decaying exponentially with time: 


g(t) = exp[—i [eae = exp[iNe~ 4/9), 1 = cK/n. (5.51) 


(Remember that the instantaneous frequency is just the time derivative 
of the phase of the wave when w is not a constant.) This is clearly not a 
monochromatic wave and has power at all frequencies. The power spectrum 
of this wave is given by P(v) = |f(v)|? where f(v) is the Fourier transform 
of o(t) with respect to t: 


s= Z poe, (5.52) 


Because of the exponential redshift, this power spectrum will not vanish 
for v < 0. Evaluating this Fourier transform (by changing to the variable 
Qexp[—(Kt/c)] = z and analytically continuing to Im z; see exercise 5.8) 
one gets: 


f(v) = (ce/K AQT (—ive/K)e™/*. (5.53) 
Taking the modulus, we get: 


B 1 27e 
v|f(-v)|? = Baw — 4? p= mI Ta (5.54) 


This leads to the the remarkable result that the power, per logarithmic band 
in frequency, at negative frequencies is a Planck spectrum with temperature 
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kpT = (h«/2nc); and, more importantly, 
If(—v)P/|F)/? = exp(—Ghv). (5.55) 


If the wave $(t) is interacting, for example, with a two level system, then you 
know from standard time dependent perturbation theory that the modes 
which vary as e**”* cause the upward and downward transitions between 
the two levels. The above result shows that such a two level system — that 
interacts with the wave — will be driven to a steady state with the two 
levels populated in accordance with the thermal distribution of temperature 
p. (Though f(v) in Eq. (5.53) depends on 2, the power spectrum |f(v)|? 
is independent of Q.) An observer detecting the exponentially redshifted 
radiation in Eq. (5.51) at late times (t — o0), originating from a region close 
tor = 2GM/c’, will attribute to this radiation a Planckian power spectrum 
given by Eq. (5.54). This result lies at the foundation of associating a 
temperature with a black hole. 

Mathematically, such a result will arise for any wave with an exponential 
redshift. If (t) varies as in Eq. (5.41) it has a power spectrum which is 
Planckian with temperature T = K/27. We will use this result in chapter 6 
in the context of exponential expansion of the universe which leads to the 
redshift of the form in Eq. (5.41). 

The general relativistic description of gravity diia changes the nature of 
orbits in the gravitational field of a point mass. To obtain this result in the 
simplest manner, let us begin with the trajectory of a particle in special 
relativity under the action of a central force. The angular momentum L = 
r x p is still conserved but the momentum is now given by p = ymv with 
y = (1—v?) 71/2, (We will temporarily use units with c = 1). So the relevant 
conserved component of the angular momentum is L = mr?(d6@/ds) = 
ymr2 rather than mr?6. (This, incidentally, means that Kepler’s second 
law regarding areal velocity does not hold in special relativistic motion in a 
central force.) Consider now the motion of a free special relativistic particle 
described in polar coordinates. The standard relation E? = p? + m? can 
be manipulated to give the equation 


2 
E? ($ ) = BP (5 n m?) (5.56) 


(This is still the description of a free particle moving in a straight line but in 
the polar coordinates!) Since special relativity must hold around any event, 
we can obtain the corresponding equation for general relativistic motion by 


simply replacing dr, dt by the proper quantities \/|gii|dr, /|goo|dt and 
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the energy E by the redshifted energy: E/vy/|goo| (see Eq. (1.75)) in this 
equation. This gives the equation for the orbit of a particle of mass m, 
energy E and angular momentum L around a mass M. With some simple 
manipulation, this can be written in a suggestive form as: 








2GM\ dr 1, 2 1/2 
(- ) aoe [E? — Vee(r)] (5.57) 
with an effective potential: 
2GM r 
2 
V(r) = m? (1 => sm) (1 + =) , (5.58) 


Several important aspects of the motion can be deduced from the nature 
of this effective potential which is shown in Fig. 5.2. 


Label = L/2GMm 


Vor 


0.95 


0.9 





r/2M 


Fig. 5.2 The effective potential governing the motion of a particle in the Schwarzschild 
metric for different values of the angular momentum. 


(i) For a given value of L and E the nature of the orbit will, in general, be 
governed by the turning points in r, determined by the equation V(r) = 
E?. For a given L > 2GMm the function Veg(r) has one maxima and one 
minima. Further, Veg — m as r — oo for all values of L. If the energy E 
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of the particle is lower than m then there will be two turning points. The 
particle will orbit the central body with a perihelion and aphelion. This is 
similar to elliptic orbits in Newtonian gravity but the perihelion will now 
precess. (See exercise 5.7.) 

(ii) If m < E < Vmax(Z) there will be only one turning point. The 
particle will approach the central mass from infinity, reach a radius of closest 
approach and will travel back to infinity. This is similar to the hyperbolic 
orbits in Newtonian gravity. 

(iii) If E = Vmax(L), then the orbit will be circular at some radius ro 
determined by the condition V’(ro) = 0,V(ro) = E. This gives the radius 
of the circular orbit and the energy to be 











To L? 12G?M?m? | - 
IGM ~ 10M am? | ty to E l (5.59) 
L? 2GM \? 
E? = = ; . 
mite (1 : ) (5.60) 


The upper sign refers to stable orbit and the lower to the unstable orbit. 
The stable orbit closest to the center has the parameters, ro = 3GM, L = 
2/3GMm and E = ,/(8/9)m. 

(iv) If E > Vinax(L), the particle “falls to the center”. This behaviour 
is in sharp contrast to Newtonian gravity in which a particle with nonzero 
angular momentum can never reach r = 0. 

(v) As the angular momentum is lowered, Vmax decreases and for L = 
2GM™m the maximum value of the potential is at Vmax = mM. In this case 
all particles from infinity will fall to the origin. Particles with E < m will 
still have 2 turning points and will form bound orbits. 

(vi) When L is reduced still further, the maxima and minima of V(r) 
approach each other. For L < 2V/3GMm, there are no turning points in 
the V(r) curve. Particles with lower angular momentum will fall to the 
origin irrespective of their energy. 

So far, we have thought of black holes as end stages of stellar evolution. 
More complicated processes can lead to the formation of black holes with 
masses significantly higher than stellar masses in the centers of galaxies. 
In the extra galactic domain there are objects called active galactic nuclei 
and quasars (see Chapter 9) which have a luminosity of about 104 erg s7! 
which are thought to be powered by accretion discs around very massive 
black holes. 
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5.5 Compact remnants in stellar binaries 


In the description of stellar evolution we have assumed that the star is an — 
isolated system and is not affected by the surrounding, which is certainly 
not the case if the star is a part of a binary star system. It is then possible 
for one of the binary stars to evolve into a compact remnant — white 
dwarf, neutron star or even a black hole — and accrete matter out of the 
second star leading to a wide variety of new physical phenomena. In such 
an accretion process, the gravitational potential energy is converted into 
the kinetic energy of matter and dissipated as thermal radiation. Some of 
the sources of high energy radiation — both galactic and extra galactic — 
are generally believed to be powered by such accretion processes. In the 
galactic scale, accretion discs around stars can be a source of thermalized 
x-ray emission; in the extra galactic domain there are objects called active 
galactic nuclet and quasars. 

The basic energetics in the process of accretion is the following: When 
a mass M falls from infinite distance to a radius R, in the gravitational 
field of a massive object with mass M,, it gains the kinetic energy E ~ 
(GM.M/R). If this kinetic energy is converted into radiation with efficiency 
e, then the luminosity of the accreting system will be L = ¢(dE/dt) = 
€(GM,/R)(dM/dt). In galactic contexts, the compact object will have 
a mass in the range of (0.6 — 1.4)Mo in case of white dwarfs or neutron 
stars and somewhat higher in case of black holes. For material accreting 
on to a neutron star of mass 1.4 Mo and radius R = 10° cm, the accretion 
luminosity will be 


GM.M R \'( M M 
Pace = R ie (as =) (i) (r =] i 


(5.61) 
The accretion luminosity will therefore increase in proportion to M. 
There is, however, a natural upper bound to the luminosity generated by 
any accretion process on to a compact remnant of mass M,. This limiting 
luminosity, called Eddington luminosity, (see Eq. (2.38) with K = oT /Mp 
arising from the electron scattering opacity) is 


4nGmyc M M, 
Le = —— M. ~ 1. ka ; = = 1049 (| —* . 
E Ea 1.3 x 10 (span) ess 10 Mo Lo 
(5.62) 


The first form of the equation is applicable in the case of accretion on to 
supermassive black holes and thought to be relevant in the context of active 
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galactic nuclei. The second form of the equality is relevant for galactic 
x-ray sources powered by stellar mass black holes. The temperature of 
a spherical system of radius R radiating L ~ Lp will be determined by 
(4r R?)oT* = Lg; that is, 


M 1/4 R —1/2 
T ~ 1.8 x 108 K £ Ten i 
8x10 GE) € ca) (5.63) 





For a solar mass compact (R ~ 1 km) stellar remnant, this radiation will 
peak in the x-ray band. 

Equating the accretion luminosity in Eq. (5.61) to the maximum possible 
luminosity in Eq. (5.62) we obtain the maximum possible accretion rate to 
be 


Meaa = (1.5 x 1078 Mo yr") (ae) ; (5.64) 
For a typical neutron star with Me = 1.4Mọ, R = 10° cm, the max- 
imum accretion rate that can be sustained in steady state is about 
1.5 x 1078 Mọoyr™!. It is also clear from the above relations that an accre- 
tion rate of (107? — 107-8) Moyr—? will produce luminosities in the range 
of (3 = 10*)Lo. 

These considerations lead to two different classes of accreting sources in 
the binary systems involving normal stars and a compact neutron star. In 
the first case, we have a neutron star orbiting around a high mass normal 
star. High mass stars have outer regions which are loosely bound and 
hence there is a strong stellar wind from such stars. the orbiting neutron 
star can accrete a fraction of the material in the wind which is flowing 
past it, thereby leading to a steady M. For typical stellar wind losses, this 
accretion rate turns out to be well described by the empirical relation 


5.67 
Mace © 5 X 107% Mo yr Gr) (5.65) 
Mo 

where M is the mass of the star. In order to have significant luminosity, 

we need Macc to be at least about 10-12Mo yr7!. The above equation 

shows that the mass of the star has to be more than about 20 Mg for this 

to be viable. These binary systems are called High mass X-ray binaries or 
HMXB, for short. 

An alternative possibility for a binary system is a low mass star and 

a neutron star orbiting around each other. If the radius of the low mass 
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star increases significantly during the course of evolution, the strong grav- 
itational field of the neutron star can remove the mass lying in the outer 
region of the companion star. (This is essentially same as the tidal disrup- 
tion discussed in chapter 1, Sec. 1.2 in connection with the Roche limit; 
see Eq. (1.38). In fact, this process is called Roche limit overflow.) This 
accretion can occur at two different time scales, depending on the mass of 
the star. If the mass of the star is more than that of the compact remnant 
(i.e, if M > 1.4Mo), then the transfer can be unstable and occur relatively 
rapidly at the thermal time scale of the star: 


M? (M/Mo)? 
Re! E Genes) 


Using the scaling relations L « M?,R x M, and assuming that most of 
the stellar mass (say, about 0.8 Ma) is transfered at this time scale, we get 
a mass transfer rate of 





Tth = (5.66) 


; 3 
M~ E ~ 3 x 1078 (=) Mọ yr™!. (5.67) 
This is, however, not a viable scenario because, for M > 1.4Mọ, this will 
lead to an accretion rate higher than the maximum accretion rate allowed 
by Eq. (5.64). The second possibility is for stars with masses less than that 
of the compact remnant (i.e, if M < 1.4Mọ). In this case, mass transfer 
takes place at (a much slower) nuclear time scale of the star: 


M Ey 
ms 10 
ES Ge) Ce = Oe 


If L x M? we have M ~ 107!°Mo yr7!(M/Mo)3. This is a possible 
accretion scenario for M < 1.4Mọ; for M = 1.4Mo this will lead to a 
maximum accretion rate of about 3 x 107!°Mo yr7 which will correspond 
to a luminosity of Lace © 10°Zo. Such systems are called Low Mass X-ray 
Binaries or LMXB, for short. Thus you see that viable accretion powered 
x-ray sources can exist only when the companion to the neutron star has a 
mass less than 1.4Mo or greater than about 20 Mo. 

Accretion to a white dwarf can lead to yet another class of phenom- 
ena of which the most important is a supernova explosion called Type I 
Supernova — to distinguish it from the supernova occurring at the end 
stage of an isolated star, which is called Type II. The Type I supernova can 
be modeled as an explosion occurring in a accreting carbon-oxygen white 
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dwarf which is a member of a closed binary system. When the mass of the 
white dwarf reaches about 1.3 Mo, carbon burning begins at the center 
of the star in the degenerate core and moves towards the surface. This 
motion occurs subsonically (called deflagration front rather than detonation 
front) converting about one half of white dwarf’s mass into iron before the 
degeneracy is removed. Further expansion can lead to a cooling which will 
eventually dampen the nuclear burning leaving shells of intermediate mass 
elements around a nickel iron core. The energy released in the process can 
disrupt the star resulting in the supernova explosion. 

Finally, we mention another remarkable result related to compact bi- 
naries though it is not directly connected with accretion of matter. This 
involves an application of general relativity in a compact binary system to 
verify a result of fundamental significance — that led to an award of the 
Nobel prize. Considering its importance, we shall briefly describe how this 
is achieved. Let us consider a binary star system of which one member is 
a pulsar from which we are receiving periodic signals and the other com- 
panion is a neutron star or black hole. In Newtonian gravity, such a binary 
system will be described by elliptical orbits. Observation of such a system 
can only determine the combination of masses given in Eq. (1.98) of Ex. 1.7. 
The situation, however, is different in general relativity. When general rel- 
ativistic effects are taken into account the major axis of the elliptical orbit 
will precess in a specific calculable rate. (See Ex. 5.7.) Using this, one ` 
can determine individually, the masses m1, mg, the size of the major axis 
A and sini where 7 is the inclination of the orbital plane with respect to 
the line of sight. Further, such an orbiting system will emit gravitational 
radiation and lose energy at a calculable rate, E. This loss of energy will 
lead to a change in the period of the orbit given by P/P = 3(E/2E). Since 
all the parameters of the system are known, one can now actually pre- 
dict the value of (P/P) and then compare it with the observations. This 
exercise has been carried out with remarkable success for a pulsar PSR 
1913+16. The predicted rate of change of the period for the star based on 
emission of gravitational radiation is P = —(2.40258 + 0.00004) x 10~!? 
which should be compared with the observed value P = —(2.425 + 
0.010) x 10-!*. They match quite well with a ratio being (1.002 + 0.005) 
thereby providing an indirect confirmation of the existence of gravitational 
radiation. 
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Exercises 


Exercise 5.1 Supernova in an ISM bubble: 


It is possible that the strong stellar winds preceding the supernova explosion of a 
star could have led to a density profile in the local ISM which has an approximate 
form pism x rÊ (so that pism Æ constant) around the star. Repeat the analysis 
of supernova blast wave propagation in such a medium and obtain the scaling 
relations. In particular, show that the Sedov phase now has R «x t?/©-2), This 
indicates that the shock wave accelerates for 8 > 3 and decelerates for 8 < 3. 
Give a physical reason for this behaviour. 


Exercise 5.2 Cutting it off: 


Consider a supernova remnant with an age of about 10° yr and a magnetic field 
of 1074 Gauss. Electrons with an arbitrary energy distribution (not necessarily 
a power law) are contributing to the synchrotron radiation from this remnant. 
Explain why the electron spectrum should have a sharp cut-off in energy at an 
energy scale which depends on time. 


Exercise 5.3 Dissecting Crab nebula: 


Given the parameters in the text for the Crab nebula: (i) Estimate (a) the 
energy density of synchrotron photons in the Crab nebula; (b) the ratio of inverse 
Compton cooling rate to synchrotron cooling rate for a relativistic electron in the 
Crab nebula. (ii) If the CMB photons undergo inverse Compton scattering with 
the relativistic electrons in Crab nebula, estimate the spectrum of the resulting 
gamma rays. Observations of gamma ray spectrum from the Crab suggest that 
J ~ 3 x 1077(E/1 TeV)~?® photons m~? s~ TeV~!. If this arises due to the 
above process, what magnetic field strength does this result imply? How is the 
result modified when we take into account the inverse Compton scattering of the 
synchrotron photons? 


Exercise 5.4 Going too fast: 


If a neutron star spins too fast, it will start losing material from the equatorial 
region due to centrifugal force. Show that this implies a minimum period Pmin « 
M~*/? R3/? where R and M are the radius and mass of the neutron star. Estimate 
Pmin supplying all the numerical factors for M = 1.4Mo, R= 10 km. 


Exercise 5.5 Neutron star collisions: 


Consider the collision of two neutron stars of equal mass obeying the mass radius 
relationship RM1/3 = constant. Compute the binding energy which is released 
in the process, assuming the same form of the mass radius relationship holds for 
the final object. What do you expect the final object to be? 
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Exercise 5.6 Separating explosively: 


Supernova explosions often occur in binary systems resulting in substantial 
amount of mass being lost from the binary. Suppose that the initial mass of 
the two stars were Mı and M2 and that the star with larger mass lost a mass 
AM due to supernova explosion. Assume that this mass AM leaves the system 
instantaneously and exerts no further influence. Show that if the amount of mass 
lost is greater than half the total mass of the system, then the resulting system 
will become unbound. [Hint: The hard way to do this problem is to work out 
what happens to the center of mass after the explosion using momentum conser- 
vation and then compute the total energy of the final system to figure out when 
it becomes positive. Some miraculous algebraic cancellations occur leading to 
the above result. The cleverer method is to note that the energy of the binary 
can be written as E = (1/2)MtotV? + u(v?/2) — (GMiotu/r) where Miot is the 
total mass, yz is the reduced mass and V is the velocity of the center of mass. 
For the initial circular orbit, the second term is —(1/2)(GMtotu/r). After the 
explosion V? is instantaneously unchanged while u and Mio. change. Take it 
from there.] 


Exercise 5.7 Precession of perihelion: 


Since the equation of motion for the particle in Schwarzschild metric is not that 
in an inverse square potential, we do not expect the Runge-Lenz vector to be 
conserved. Hence, an elliptical orbit of the Newtonian gravity will precess due 
to the lowest order general relativistic correction. Obtain the rate of precession - 
along the following lines: (a) Use the arguments given in the text to conclude 
that r?ĝ/ J00 is a constant. Hence obtain the equation to the orbit valid in the 
lowest order of approximation: 


du GMm? 3GM 2 
q2 t! 7 -a T a u (5.69) 





where u = (1/r) and L is a constant. Retaining only the first term on the right 
hand side gives the Newtonian elliptical orbit while the second term leads to 
the precession. Solving this equation approximately, show that the precession 
of the major axis for a low eccentricity, nearly circular, orbit is: 6¢ ~% 2r[1 — 
(6GM/c?ro)|~*/? per orbit where ro is the radius of the original orbit. 


Exercise 5.8 Black hole temperature: 


Obtain the results in Eq. (5.53) and Eq. (5.54) along the following lines. The 
first step is to change the integration variable to z = Qexp[—(xt/c)]. From 
the analytic properties of the integrand, show that it is legitimate to rotate the 
contour of integration along the Im—z axis. The resulting integral is essentially 
a gamma function and will lead to Eq. (5.53). To get Eq. (5.54) from this result, 
you need |I (iz)|? which is equal to (i/z)f'(iz)I'(1 — iz). Take it from there. 
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Exercise 5.9 Pseudo Newtonian potential: 


Some aspects of the motion in Schwarzschild metric can be mimicked by using 
an effective Newtonian potential des = —GM/(r — Rs) where Rs = 2GM/c?. 
Compare the motion of a non relativistic particle under the action of this potential 
with the exact result in Schwarzschild metric. 


Exercise 5.10 Spinning up the companion: 


When the disk accretion takes place around a central compact object, the ac- 
cretion disk can exert a torque on the compact object and spin it up. The rate 
at which the angular momentum gets transfered from the Alfven radius rm (see 
Eq. (3.48)) is given by rhr2,w(rm) = Iw. This will change the rotational period 
of the compact object at the rate P/P = —(WP/2m). Show that this leads to a 
scaling law P/P œ L8/TP where L is the accretion luminosity. The spin up will 
stop when the rotational velocity of the compact object is equal to the angular 
velocity of the disk at the Alfven radius. Show that the final period obeys the 
scaling Pana « BT m3. 


Exercise 5.11 Losing mass is attractive? 


Consider two stars of mass mı and m2 in which mı transfers mass to m2 through 
filling the Roche lobe. This process conserves mass and angular momentum but 
not energy since the accretion dissipates energy into heat. Assume that the 
stars are on circular orbit of radius R which is also changing with time due 
to accretion. From conservation of angular momentum and mass, show that 
R/R = 2(m2—m1)(th2/mimz). Hence show that if the lighter star is losing mass 
the stars gradually separate away while if the heavier star is losing mass the stars 
come closer catastrophically. 


Exercise 5.12 What color is white dwarf? 


Suppose we plot the position of white dwarfs in a diagram giving the V-magnitude 
against the V-I colour. Estimate the slope of this line. How does it compare with 
the slope of the main sequence stars? 


Exercise 5.13 Energy budget: 


Compare the total energy in the magnetic field and particles in Crab today with 
the original energy released in the supernova explosion and energy lost by the 
pulsar during its age. 


Exercise 5.14 Supernova and intra cluster medium: 


The material between the galaxies in a galaxy cluster, usually called intra cluster 
medium (ICM for short), contains fair amount of metals and gas but, of course, 
no stars. The metallicity is about Zicm % 0.3Zo where Zo = 0.02. (a) Assume 
that a typical supernova arises from a 20Mo star which leaves behind a 1.5Mo 
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remnant and ejects the rest of its material of which about 2 Mo is in the form of 
metals. Compute the metallicity Zsn of the gas ejected from a single supernova. 
(b) The total intracluster mass can be written as Micm = Mej + Mp where Me; 
is the gas from the supernova ejecta with metallicity Zsn and Mp is the zero 
metallicity primordial gas which fell in the potential well as the cluster formed. 
Using the values given so far, estimate the fraction (M.;/Micm) of the gas that 
came from supernova. (c) The energy released by the supernova Esn = 10°! erg 
will also go to heat up the gas. Compute the resulting temperature of the gas and 
compare it with the intra cluster temperature estimated from x-ray luminosity 
which is about (5-10) keV. What do you conclude? (d) If Micm © 6 x 103 Mọ, 
find the number of supernova needed to generate the metallicity of 0.3Z6. 


Notes and References 


A text focussed on stellar remnants is Shapiro and Teukolsky (1983). Figure 5.1 
is adapted from chapter 4 of Padmanabhan (2001). 

1. Section 5.2: A detailed discussion of transport coefficients can be found 
in Landau and Lifshitz (1975c). White dwarfs are discussed in many books on 
stellar evolution; see, for example, Hansen and Kawaler (1994) and Kawaler et. 
al. (1997). 

2. Section 5.8: For more detailed discussion see, Meszaros (1992); Michel 
(1991), Lyne Graham-Smith (1998) and Blandford et al. (1993). 

3. Section 5.4: Black hole physics is covered in all good texts in general 
relativity, like, for example, Misner Thorne and Wheeler (1973). 

4. Section 5.5: The evolution of binary stars is a major branch of study all 
by itself. Some text books which deal with it are Padmanabhan (2001), de Loore 
and Doom (1992), Frank King and Raine (1992). A nice discussion of PSR 
1913+16 is available in the article by J.H. Taylor in Bondi and Weston-Smith 
(1998). 
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Chapter 6 


Cosmology and the Early Universe 


Having studied the physics of stars and stellar remnants, one might be 
tempted to proceed directly to the next level in astrophysical structures, 
viz., galaxies. But the formation (and evolution) of galaxies is so closely 
tied up with cosmological considerations that it is necessary to grapple with 
aspects of cosmology to understand galaxies in their totality. Keeping this 
in mind, we will devote this chapter to the physics of the evolving universe 
and take up the study of galaxies in the coming chapters. 


6.1 Evolution of the universe 


Observations suggest that the universe at large scales is homogeneous 
(‘same at all points’) and isotropic (‘same in all directions’). One way 
to quantify the idea of homogeneity is the following: Suppose you place 
a sphere of radius R at different locations in the universe and compute 
the mass (or, what is more relevant, the energy) contained within it. In a 
strictly uniform universe with density J, you will always get the mass to be 
M(R) = (47/3)pR?. In a real universe, you will get different values when 
the sphere is centered at different points so that there will be a fractional 
rms fluctuation a(R) = ((M — M)*)/?/M = (5M/M)p around the mean 
mass. If this quantity decreases with R and becomes sufficiently small for 
large R, then it makes sense to talk about a universe which is smooth at 
large scales with a mean density p. In that case, if we average the inho- 
mogeneities over a sufficiently large scale R, the fractional fluctuations will 
be small, making the average density a useful quantity. In our universe, 
o(R) % 0.1 at R % 50h-! Mpc (where 1 Mpc % 3 x 104 cm is a conve- 
nient unit for cosmological distances) and o(R) decreases roughly as R~? 
at larger scales. 
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You can, of course, do the same kind of observations with the density 
{rather than mass) and you will find that the fractional fluctuations (ôp/ p) rR 
in the mass density (and energy density) p, within a randomly placed sphere 
of radius R, also decreases with R as a power law. We will use the fractional 
contrast in the fluctuation of either mass or density interchangeably. 

This suggests that we can model the universe as made up of a smooth 
background with an average density p(t) — which is, at best, only a function 
of time and we have omitted the overbar on f to simplify the notation — 
superposed with fluctuations in the density dp(t, x), which are large at small 
scales but decreases with scale. At sufficiently large scales, the universe may 
be treated as being homogeneous and isotropic with a uniform density. 

Any large scale motion of matter in such a universe must also be con- 
sistent with homogeneity and isotropy. That is, all observers, located at 
different spatial points, must agree on the nature of the large scale motion. 
Of course, this condition is trivially satisfied if there is no large scale mo- 
tion at all and all observers see the universe as static. But it is possible 
to come up with a more general form of the motion which has this fea- 
ture. Suppose the velocity of an object located at a point r (with respect 
to some origin) is given by r(t) = v(t) = f(t)r. An observer at the origin 
will see all particles moving with a speed proportional to the distance from 
him/her. Consider now another observer at a different location rı. In the 
Newtonian picture, (s)he will see the object at r to move with the velocity 
v’ = v— vı = f(t)(r—r11) so that (s)he too will conclude that the particles 
are moving away with a speed proportional to the distance from him/her! 
So, at least in the Newtonian context of velocity addition, one can have a 
motion of the form i(t) = v(t) = f(é)r that is consistent with homogeneity. 

Integrating this relation, r = f(t)r, we can describe the position r of any 
material body in the universe in the form r = a(t)x where a(t) is another 
arbitrary function related to f(t) by f(t) = (@/a) and x is a constant for 
any given material body in the universe. It is conventional to call x and 
r the comoving and proper coordinates of the body and a(t) the expansion 
factor. The dynamics of the universe is entirely determined by the function 
a(t). The simplest choice will be a(t) = constant, in which case there will 
be no motion in the universe and all matter will be distributed uniformly 
in a static configuration. It is, however, clear that such a configuration will 
be violently unstable when the mutual gravitational forces of the bodies 
are taken into account. Any such instability will eventually lead to random 
motion of particles in localized regions thereby destroying the initial homo- 
geneity. Observations, however, indicate that this is not true and that the 
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relation v = (@/a)r does hold in the observed universe with å > 0. 

In that case, the dynamics of a(t) can be qualitatively understood along 
the following lines. Consider a particle of unit mass at the location r, with 
respect to some coordinate system. Equating the sum of its kinetic energy 
v?/2 and gravitational potential energy (-GM(r)/r) due to the attraction 
of matter inside a sphere of radius r, to a constant, one finds that a(t) 
should satisfy the condition 

1.2  4rGp(t) 


T le a? = constant (6.1) 


where p is the mean density of the universe; that is, 


„2 
ota e i) (6.2) 
where k is a constant. 

While the arguments given above to determine this equation (as well 
as the relation v = fr)) are fallacious, the results and — in particular, 
Eq. (6.2) — happens to be exact and arises from the application of general 
theory of relativity to a homogeneous and isotropic distribution of matter 
with p interpreted as the energy density. We shall now briefly describe how 
these results arise in general relativity. 

We saw in chapter 1 that general relativity describes gravity as a mani- 
festation of spacetime curvature. In the case of a homogeneous and isotropic 
spacetime, the spacetime interval has a particularly simple form: 

2 2 2 dr? 2/492 4 ain? 2 

ds“ = dt“ — a“ (t) SS + r°(dd* + sin“ 6d¢ | (6.3) 
where the parameter k modifies the metric of 3-space and the function a(t) 
scales the distances between any two points in the 3-dimensional space. 
One of the Einstein’s equations reduces to Eq. (6.2) while the rest of the 
equations are satisfied if the equations of motion for matter are satisfied. 
When k = 0, life becomes particularly simple and the line interval reduces 
to 


ds? = dt? — a? (t) [dx? + dy? + dz?]. (6.4) 


This is the form we will use quite often. 
Having settled that Eq. (6.2) describes the dynamics of the universe, let 
us look at it more closely. Observations suggest that our universe today (at 
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t = to, say) is governed by Eq. (6.2) with 
(2) = Ho = 0.3 x 107!7h s7! = 100 h kms! Mpc”! (6.5) 
0 


where h = (0.6 — 0.8). We will always refer to quantities evaluated at the 
present epoch (t = to) with a subscript zero. The relation v = f(t)r = 
(a/a)r = Hor is known as Hubble’s law and Ho is called Hubble’s constant. 
From Ho we can form the time scale tuniy = Hs! ~~ 1010h} yr and the 
length scale cHg 1 = 3000h—! Mpc; tuniy characterizes the evolutionary 
time scale of the universe and cH>' is of the order of the largest length 
scales currently accessible in cosmological observations. More importantly, 
one can also construct from Ho a quantity with the dimensions of density 
called critical density: 
3Hô 2 —29 ~3 1142 -3 
pe = gpg = 1.88h x 107% gm cm™’? = 2.8 x 10- hk “Mo Mpc 
= 1.1 x 10th? eV cm? = 1.1 x 1075h? protons cm~*. (6.6) 


The last two “equalities” should be interpreted in terms of conversion of 
mass into energy by a factor c? and the conversion of mass into number 
of baryons by a factor my 1 where m, is the proton mass. It is useful to 
measure all other mass and energy densities in the universe in terms of 
the critical density. If p; is the mass or energy density associated with a 
particular species of particles (say, photons, baryons etc....), then we define 
a density parameter 2; through the ratio Q; = (p:/pc). More generally, 
H(t) = (@/a), p;(t) and hence Q;(t) can be a function of time. 

It is obvious from Eq. (6.2) that the numerical value of k can be ab- 
sorbed into the definition of a(t) by rescaling it so that we can treat k as 
having one of the three values (0,—1,+1). The choice among these three 
values for k is decided by Eq. (6.2) depending on the value of the total 
Q = }_ Q; we see that k = 1,0 or —1 depending on whether Q greater 
than, equal or less than unity. The fact that k is proportional to the total 
energy of the dynamical system described by Eq. (6.2) shows that a(t) will 
have a maximum value followed by a contracting phase to the universe if 
k=1,0>1. 

The light emitted at an earlier epoch by an object will reach us today 
with the wavelength stretched due to the expansion. If the light was emitted 
at a = Qe and received today (when a = ag), the wavelength will change by 
the factor (1+ Ze) = (ao/ae) where ze is called the redshift corresponding 
to the epoch of emission, ae. Correspondingly, the frequencies will decrease 
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with expansion as v œ (1+ z)~!. Thus we associate a redshift z(t) to any 
epoch a(t) by (1 + z)~! = a(t)/ao. It is very common in cosmology to use 
z as a time coordinate to charatecterise an epoch in the past. Since the 
observed luminosity L of a source is proportional to (|p,|c)d°p, œ v3dv œ 
(1+ 2z)~*, where p, = (e/c) = (hv/c) is the photon momentum, L will 
decrease as (1 + z)~* in an expanding universe. 

To determine the nature of the cosmological model we need to determine 
the value of Q for the universe, taking into account all forms of energy 
densities which exist at present. Further, to determine the form of a(t) we 
need to determine how the energy density of any given species varies with 
time. Let us turn to this task. 

Suppose a particular species contributes an energy density p and pres- 
sure p. Then we need to integrate the equation of motion d(pa*?) = —pd(a*) 
(which is essentially dE = —pdV with V œ a?) to determine how p varies 
with a. The simplest type of equation of state which is adequate for de- 
scribing the large scale dynamics of the universe is p = wp with w = 
constant. Then the equation d(pa?) = —wpd(a?) can be immediately in- 
tegrated to give p x a~3(@+). Doing this for each component of energy 
density, Eq.(6.2) can be now written in the form 


42 3(1+wi) fk 
aC A 


where each of these species is identified by density parameter 2; and the 
equation of state characterized by w;. 

The most familiar forms of energy densities are those due to pressureless 
matter or radiation. For pressureless matter — usually called ‘dust’ — we 
can take w; = 0 (this represents non relativistic matter with the rest mass 
energy density pc? dominating over the kinetic energy density (pv2/2)). On 
the other hand, for radiation or extreme relativistic particles, we have (see 
Sec. 3.1) w; = (1/3). Observational situation regarding the composition 
of our universe, however, suggests a more complex situation and can be 
summarised as follows: 


e Our universe has Qot œ% 1 or, more precisely, 0.98 S Qiot S 
1.08. The value of Qiot can be determined from the angular 
anisotropy spectrum of the Cosmic Microwave Background Radia- 
tion (CMBR), with the reasonable assumption that h > 0.5, and 
these observations (which we will discuss in Sec. 6.6) now show 
that we live in a universe with density close to the critical density. 


216 


An Invitation to Astrophysics 


e Observations of primordial deuterium produced in big bang nu- 


cleosynthesis (which took place when the universe was about 1 
minute in age; see Sec. 6.2) as well as the CMBR observations 
show that the total amount of baryons in the universe contributes 
about Ng = (0.024 + 0.0012)h~?. Given the independent observa- 
tions on the Hubble constant which fix h = 0.72+0.07, we conclude 
that Qg = 0.04 — 0.06. These observations take into account all 
baryons which exist in the universe today irrespective of whether 
they are luminous or not. When combined with the previous item 
we conclude that most of the universe is non-baryonic. 
Host of observations related to large scale structure and dynamics 
(such as rotation curves of galaxies, estimate of cluster masses, 
gravitational lensing, galaxy surveys) all suggest that the universe 
is populated by a non-luminous component of matter (dark matter; 
DM hereafter) made of weakly interacting massive particles which 
does cluster at galactic scales. This component contributes about 
Qpm = 0.20 — 0.35. 
Combining the last observation with the first we conclude that 
there must be (at least) one more component to the energy density 
of the universe contributing about 70% of critical density. Early 
analysis of several observations indicated that this component is un- 
clustered and has negative pressure. This is confirmed dramatically 
by more recent supernova observations and analysis of CMBR data. 
The observations suggest that the missing component (called dark 
energy) has w = p/p < —0.78 and contributes Npr = 0.60 — 0.75. 
The simplest choice for dark energy is a “fluid’ with p = —p 
such that w = —1. In this case, the equation d(pa*) = —pda?® is 
identically satisfied with p = —p = constant! That is, a fluid with 
w = —1 will have the same energy density and pressure at all times 
as the universe expands. This is to be contrasted with the energy 
density of normal matter which decreases as the universe expands 
— because of the work done by the pdV term with p > 0. When 
the pressure is negative, the decrease in the energy density due to 
expansion can be compensated by the negative work done thereby 
maintaining constant energy density. It is indeed possible to mimic 
such a fluid by adding a term to Einstein’s equations called the 
cosmological constant. It is also possible that the energy of the 
vacuum state of the universe is non zero and exerts a gravitational 
influence. You will expect the vacuum to have an energy density 
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and pressure which is constant in space and time; the equation 
d(pa?) = —pda? then demands that it should have an equation of 
state p = —p. (Of course, one way of satisfying this is to have 
p=p=0 but it also admits p = —p= constant with any constant 
value.) Hence one often calls the dark energy arising from a fluid 
with equation of state p = —p either as cosmological constant or 
vacuum energy. We shall use these expressions interchangeably. 

e The universe also contains radiation (with p = p/3) contribut- 
ing an energy density Nph? = 2.56 x 1075 today most of which 
is due to photons in the CMBR. Assuming that most of the en- 
ergy density is at temperature T = 2.73 K today, we get pr = 
(x? /15) (k£T4/c?h3). Dividing this by pe ~ 1.88 x 10-9h?2g cm~’ 
we find that Ngh? = 2.56 x 1075. 


Taken together we conclude that our universe has (approximately) 
Npe x 0.7,9pm ~ 0.26,98 ~ 0.04,Ne ~ 5 x 1075. All known obser- 
vations are consistent with such an — admittedly weird — composition for 
the universe. 

As far as dynamics is concerned, we therefore need to deal with three 
different forms of energy densities in the universe: pyr,pr and py. If 
neither particles nor photons are created or destroyed during the expansion, 
then the number density n of particles or photons will decrease as n « a~? 
as a increases. In the case of photons, the wavelength will also get stretched 
during expansion with » « a; since the energy density of material particles 
is nme? while that of photons of frequency v will be nhv = (nhc/A), it 
follows that the energy densities of radiation and non relativistic matter 
vary as pp x a74 and pyr « a~%. Since the vacuum energy density should 
be Lorentz invariant, it has to be a constant independent of space or time 
giving py = constant. Of course, you get the same result by integrating 
d(pa*) = —pda? = —wpda to give p x a~3+”) and using w = (0, 1/3, —1) 
for non relativistic matter, radiation and dark energy respectively. 

Note that as we go to smaller a(t) in the past, radiation energy density 
grows faster (as Qra~*) compared to matter energy density (which grows 
as Qnra~*). So even though radiation is dynamically irrelevant today, it 
would have been the dominant component in the universe at sufficiently 
small a: when a < aeq = do(QNR/QR). That is, at redshifts larger that 
Zeg ~% OnR/QR S 4X 104QyRh?. Further, combining pr x a~t with the 
result pp « T4 for thermal radiation, it follows that any thermal spectrum 


of photons in the universe will have its temperature varying as T œ a7}. 
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In the past, when the universe was smaller, it would also have been: (i) 
denser (ii) hotter and — at sufficiently early epochs — (iii) dominated by 
radiation energy density since (pr/pnr) « (1/a). 

Given all these, the total energy density in the universe at any epoch 
can be expressed as 


Ptotai (a) = pr (a) + pwr (a) + pv (a) 
= pe [Or ($) + (+ 90m) (L) +] 69 


where pe and various Q’s refer to their values at a = ag; Ng refers to the 
density parameter of baryons. Substituting this into Eq. (6.7) we get 


a k ao\ 4 ag\ 3 

2+4 =H for (=) +2nr (2) + ov (6.9) 
a a a a 

with Onr = NB + Npm. This equation can be cast in a more suggestive 
form. We can write (k/a?) as (Qtot — 1)Hê(ao/a)? and move it to the 
right hand side. Introducing a dimensionless time coordinate 7 = Hot and 


writing a = aoq(T) our equation becomes 


1 /dq 4 
rakes +V(q)=E (6.10) 
where 
— 1 OR ONR 2|. = 1 
vo = -4 [B+ pave], B= 50%). (6.11) 


Equation (6.10) has the structure of the first integral for motion of a particle 
with energy E in a potential V (q). For models with Qtot = QNR+OQV+QR S 
OQnr+Qy = 1, we can take E = 0 so that (dg/dr) = ,/—2V(q). Figure 6.1 
shows the velocity (dq/dr) as a function of the position q = (1+ z)7! for 
such models. Several features are clear from this figure: (i) At high redshift 
(small q) the universe is radiation dominated (with V (q) ~ —(1/2)Qrq7?) 
and q is independent of the other cosmological parameters; hence all the 
curves asymptotically approach each other at the left end of the figure. 
(ii) At lower redshifts [O(1) < z < 10*] the universe is matter dominated 
and V(q) ~ —(1/2)Qnrq7!. (iii) At still lower redshifts [0 < z < O(1)] 
the presence of cosmological constant makes a difference and — in fact — 
the velocity q changes from being a decreasing function to an increasing 
function. In other words, the presence of a cosmological constant leads to 
an accelerating universe at low redshifts. 
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Fig. 6.1 The dynamical evolution of the universe depicted as a phase space trajectory. 
The vertical axis gives the rate of expansion of the universe (in rescaled units) and the 
horizontal axis gives the expansion factor of the universe (in rescaled units). The different 
curves are for different cosmological models with Qp = 2.56 x 1075h-?,h = 0.5, QNR + 
Ny = 1. Curves are parameterized by the value of Qnr = 0.01, 0.1, 0.3, 0.5, 1.0 going 
from bottom to top as indicated. 


Based on the above considerations, we can identify three distinct phases 
in the evolution of the universe depending on which form of the energy 
density dominates the expansion. At very early epochs, the radiation will 
dominate over other forms of energy density and Eq. (6.10) can be easily 
integrated to give a(t) « t!/?. As the universe expands, a time will come 
when (at t = teq with a = aeq and z = Zeq, say) the matter energy density 
will be comparable to radiation energy density. From the known value of 
Np, it follows that (1+ zeq) = (Qnr/QR) ~ 3.9 x 104(QnRA”). Similarly, at 
very late epochs, the vacuum energy density (cosmological constant) will 
` dominate over non relativistic matter and the universe will become “vacuum 
dominated”. This occurs at a redshift of z, where (1+ zv) = (Q,/Onr)!/9. 
For Q, ~ 0.7, NR = 0.3, this occurs at z, + 0.33. During Zy < Z X Zeq, 
the universe is matter dominated with a œ t?/3. 

The discussion so far did not really use the fact that the geometry of 
the universe is curved. This fact becomes relevant when one tries to find 
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expressions for observable quantities in the universe at moderate redshifts. 
Since the line interval of the universe is different from that of flat spacetime, 
several observable (geometrical) quantities are affected by the spacetime 
curvature. We shall now briefly consider a few of them concentrating on 
the k = 0 model. 

Cosmological observations are mostly based on electromagnetic radia- 
tion which is received from far away sources. Let an observer, located at 
r = 0, receive at time t = to radiation from a source located at r = rem. 
This radiation must have been emitted at some earlier time te such that 
the events (te, fem) and (to,0) are connected by a null geodesic. Taking 
the propagation of the ray to be along 8 = constant, @ = constant, we can 
write the equation for the null geodesic to be dt? = a(t)dr?. Integrating 
this, we can find the relation between rem and te: 


t dt 
Tem = (6.12) 


te a(t)’ 


The right hand side is a definite function of te in a given cosmological 
model. Since the redshift z is also a unique function of te, we can express 
the right hand side as a function of z and hence rem can be expressed as a 
function of z. This function rem(z) is of considerable use in observational 
astronomy, since it relates the radial distance to an object (from which the 
light is received) to the redshift at which the light is emitted. We see from 
Eq. (6.2) that Einstein’s equations allow us to determine (å/a) more easily 
than a(t) itself. It is, therefore, convenient to define a quantity (called 
Hubble radius) by 


di(t) = du(2) = op (6.13) 


where the first equality reminds you that dy can also be thought of as a 
function of the redshift z. This quantity allows us to convert integrals over 
dt to integrals over dz by using 


dt = (=) (=) deste (=) (6.14) 


It is now possible to write Eq. (6.12) as 





Tem = T dy(z)dz (6.15) 
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if we normalize a(t) by a(to) = 1. As an example, consider the matter dom- 
inated case in which (@/a) = Ho /Onra~?/?. Then dy = Hp Oh? (1 + 
z)~3/2 and 





Tem(2) = 2HP RR? h = eed (6.16) 

Two important observable quantities in which re,(z) plays a role are 

in: (i) relating the luminosity of distant objects with the observed flux and 

(ii) in the measurement of angular sizes of distant objects which we shall 

now discuss. Let F be the flux received from a source of luminosity £ when 

the photons from the source reach us with a redshift z. The flux can be 
expressed as 


ay Ch eee 1 dEem 
~ (Area) ( dtrec ) ~ (Area) (1 + z)? ( dtem ) eno 
since dEyec = (1 + z)~!dEem and dtrec = (1+ z)dtem due to redshift. The 
proper area at a = ag = 1 of a sphere of comoving radius rem(z), is 4rr2,,, 


giving 
1 1 dEem 1 1 
a a 6.18 
T dare, (1+ 2)? :( dtem ) Anr2 (1+ Ae ar) 


where L = (dEem/dtem) is the luminosity of the source. Since the distance 
to an object of luminosity £ and flux F can be expressed as (L/4rF)!/? 
in flat Euclidean geometry, it is convenient to define distance dz (z) (called 
luminosity distance) by 


1/2 z 
di{z)= (5) = Tem(2)(1 +z) = (1 + a | dy (z)dz. (6.19) 


From Eq. (6.15) and Eq. (6.19) we have the inverse relation 


dr (2z) 
dno) = $ [ZO]. (6.20) 
Thus if dz(z) can be determined from observations, one can determine 
dy(z) and thus (å/a). This is the key to supernova observations which 
determined that the universe is accelerating in its expansion in the recent 
past. 

Another observable parameter for distant sources is the angular diame- 
ter. If D is the physical size of the object which subtends an angle 6 to the 
observer, then, for small 6, we have D = rema(t.)d. The “angular diameter 
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distance” d,(z) for the source is defined via the relation 6 = (D/da); so 
we have: 


PA eer E = [ EE (6.21) 


Comparing Eq. (6.19) and Eq. (6.21) we see that dg = (1 + z)*d,4. For 
example, in the matter dominated case, 





2H 1-1/2 1 
da = Z0 “NR | x | ~ 2H~'9-1/2,-1 6.22 
A (1+z) VI+z PT SR (052) 


with second equality being valid at large z. We will need this result in 
Sec. 6.6. Also note that d4(z) is not monotonic in z. It increases as d4 œ z 
for small z, reaches a maximum and decreases as z~! at large z. 

Finally we note that the proper volume in the universe spanned by the 
region between comoving radii rem and Tem + drem can also be expressed 
in terms of dy(z) and rem(z). This infinitesimal proper volume element 
dV = „Pg dx is given by 


dV = a? r2n drem sin 6 dô dọ. (6.23) 


If we interpret this volume as the volume of the universe spanned along 
the backward light cone by the photons which are receive today with red- 
shifts between z and z + dz, then we can express drem in terms of dt using 
Eq. (6.12) and express dt in terms of dz using Eq. (6.14) and obtain 


2 
2 2 3 Tem(2)dx (2) 5 
= = aNd ; ; 
dV = adt (rém sin 0d0d¢) (1+2)3 z sin 0dôdo (6.24) 


This is used extensively in observational cosmology. 


6.2 Primordial nucleosynthesis 


Let us now try to figure out the key events in the life of such a universe. 
When the temperature of the universe was higher than the temperature 
corresponding to the atomic ionisation energy, the matter content in the 
universe will be a high temperature plasma. For t < teq, in the radiation 
dominated phase, we can also ignore the contribution of non-relativistic 
particles to p. We have seen earlier that, during the radiation dominated 


Chapter 6 : Cosmology and the Early Universe 223 


phase, a(t) œ t!/?; therefore 


»\2 2 
(ż) = H? (t) = i = ze, = see (5) T4, (6.25) 
The last equality follows from the expression for energy density for radiation 
(see Eq. (2.20)) in units with h = c = kg = 1. The factor g in these 
expressions counts the degrees of freedom of those particles which are still 
relativistic at this temperature T. As the temperature decreases, more 
and more particles will become non-relativistic and g will decrease; thus 
g = g(T) is a slowly decreasing function of T. Numerically, 


—2 
a -1 (MPN _ T ai 
t S 0.3g? (=) > 1s (m) g 2. (6.26) 


In the standard models for particle interactions, g ~ 10? at T > 1GeV; 
g œ 10 for T ~ (100 — 1)MeV and g ~ 3 for T < 0.1MeV. These relations 
give the temperature of the universe as a function of time. 

When the temperature of the universe is higher than the binding energy 
of the nuclei (~ MeV), none of the heavy elements (helium and the metals) 
could have existed in the universe. The binding energies of the first four 
light nuclei, 7H, 3H, 3He and 4He are 2.22 MeV, 6.92 MeV, 7.72 MeV and 
28.3 MeV respectively. This would suggest that these nuclei could be formed 
when the temperature of the universe is in the range of (1 — 30)MeV. The 
actual synthesis takes place only at a much lower temperature, Tnuc = Tn ~ 
0.1MeV. The main reason for this delay is the ‘high entropy’ of our universe, 
i.e., the high value for the photon-to-baryon ratio, 7~!. Numerically, 


NB —10 sgh? 
= Å— 6: oer reat . 2 
a 5.5 x 10 (3 (6.27) 





We have already seen a similar effect in chapter 3 Sec. 3.4.1 in the case of 
ionisation equilibrium, in which this effect was described by Eq. (3.75). Let 
us assume, for a moment, that the nuclear (and other) reactions are fast 
enough to maintain thermal equilibrium between various species of particles 
and nuclei. In thermal equilibrium, the number density of a nuclear species 
ANz with atomic mass A and charge Z will be 


NA=9A (a exp |- (Tact) : (6.28) 


By repeating exactly the same kind of analysis we did for ionisation equi- 
librium for this case, one can obtain the equation for the temperature TA 
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at which the mass fraction of a particular species-A will be of order unity 
(Xa = 1) 


Ba/(A—1) 


cS In(n~!) + 1.5 In(mB/T) 


(6.29) 


where Ba is the binding energy of the species. (This is analogous to 
Eq. (3.75).) This temperature will be fairly lower than B4 because of 
the large value of n~!. For 2H, ?He and tHe the value of T4 is 0.07MeV, 
0.11MeV and 0.28MeV respectively. Comparison with the binding energy 
of these nuclei shows that these values are lower than the corresponding 
binding energies B4 by a factor of about 10, at least. 

Thus, even when the thermal equilibrium is maintained, significant syn- 
thesis of nuclei can occur only at T < 0.3MeV and not at higher temper- 
atures. If such is the case, then we would expect significant production 
(Xa S 1) of nuclear species-A at temperatures T < Ta. It turns out, 
however, that the rate of nuclear reactions is not high enough to maintain 
thermal equilibrium between various species. We have to determine the 
temperatures up to which thermal equilibrium can be maintained and redo 
the calculations to find non-equilibrium mass fractions. 

The general procedure for studying non equilibrium abundances in an 
expanding universe is based on rate equations. Since we will require this 
formalism again in Section 6.3 (for the study of recombination), let us spend 
a little time and develop it in a somewhat general context. 

Consider a reaction in the early universe in which two particles 1 and 2 
interact to form two other particles 3 and 4. For example, n + ve = p +e 
constitutes one such reaction which converts neutrons into protons in the 
forward direction and protons into neutrons in the reverse direction; another 
example you will come across in the next section is p+e = H +y where the 
forward reaction describes recombination of electron and proton forming a 
neutral hydrogen atom (with the emission of a photon), while the reverse 
reaction is the photoionisation of a hydrogen atom. In general, we are 
interested in how the number density nı of particle species 1, say, changes 
due to a reaction of the form 1 + 2 = 3 + 4. 

We first note that even if there is no reaction, the number density will 
change as nı x a~? due to the expansion of the universe; so what we are 
really after is the change in na. Further, the forward reaction will be 
proportional to the product of the number densities nnz while the reverse 
reaction will be proportional to n3n4. Hence we can write an equation for 
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the rate of change of particle species nı as 


T d(n,a°) 


=: di = (Angana = nno). (6.30) 


The left hand side is the relevant rate of change over and above that due 
to the expansion of the universe; on the right hand side, the two propor- 
tionality constants have been written as u and (Ap), both of which, of 
course, will be functions of time. The quantity has the dimensions of 
cm%s~1, so that nu has the dimensions of s~!; usually u ~ ov where o 
is the cross-section for the relevant process and v is the relative velocity. 
The left hand side has to vanish when the system is in thermal equilibrium 
with n; = n;*, where the superscript ‘eq’ denotes the equilibrium densities 
for the different species labeled by i = 1 — 4. This condition allows us to 


rewrite A as A = nj2n54/(n3 nj"). Hence the rate equation becomes 





1 d(n,a*) eq_eq ( 73M4 nnz 
aa dt = HNI 2 nenga = nein r (6.31) 


In the left hand side, one can write (d/dt) = Ha(d/da) which shows 
that the relevant time scale governing the process is H71. Clearly, when 
H/np > 1 the right hand side becomes ineffective because of the (u/H) 
factor and the number of particles of species 1 does not change. You see 
that when the expansion rate of the universe is large compared to the 
reaction rate, the given reaction is ineffective in changing the number of 
particles — which is what you will expect. This certainly does not mean 
that the reactions have reached thermal equilibrium and n; = 7;%; in fact, it 
means exactly the opposite: The reactions are not fast enough to drive the 
number densities towards equilibrium densities and the number densities 
“freeze out” at non equilibrium values. Of course, the right hand side will 
also vanish when n; = n$? which is the other extreme limit of thermal 
equilibrium. 

Having taken care of the general formalism, let us now apply it to the 
process of nucleosynthesis which requires protons and neutrons combin- 
ing together to form bound nuclei of heavier elements like deuterium, he- 
lium etc.. The abundance of these elements are going to be determined 
by the relative abundance of neutrons and protons in the universe. There- 
fore, we need to first worry about the maintenance of thermal equilibrium 
between protons and the neutrons in the early universe. As long as the 
inter-conversion between n and p through the weak interaction processes 
(v+ne pte), (€+n > p+7) and the ‘decay’ (n e p+e+D), is rapid 
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(compared to the expansion rate of the universe), thermal equilibrium will 
be maintained. Then the equilibrium (n/p) ratio will be 


(=) = Ža = exp(-Q/T), (6.32) 


Np Xp 


where Q = mn — Mp = 1.293 MeV. Therefore, at high (T > Q) tem- 
peratures, there will be equal number of neutrons and protons but as the 
temperature drops below about 1.3 MeV, the neutron fraction will start 
dropping exponentially provided thermal equilibrium is still maintained. To 
check whether thermal equilibrium is indeed maintained, we need to com- 
pare the expansion rate with the reaction rate. The expansion rate is given 
by H = (8rGp/3)!/2 where p = (r?/30)gT4 with g = 10.75 representing 
the effective relativistic degrees of freedom present at these temperatures. 
At T = Q, this gives H ~ 1.1 s71. The reaction rate needs to be computed 
from weak interaction theory. The neutron to proton conversion rate, for 


example, is well approximated by 
O ala 
(2) +6 F +12]. (6.33) 


5 
Anp © 0.29 s7! (5) 
At T = Q, this gives \ = 5 s7!, slightly more rapid than the expansion rate. 
But as T drops below Q, this decreases rapidly and the reaction ceases to 
be fast enough to maintain thermal equilibrium. Hence we need to work 
out the neutron abundance by using Eq. (6.31). 
Using nı = Nn, N3 = Np and n2,n4 = nı where the subscript l stands 
for the leptons, Eq. (6.31) becomes 


1 d(nna? Npnea 
Lani pnts ( rad - tin) (6.34) 
p 











a á dt 


We now use Eq. (6.32), write (n;") = Anp which is the rate for neutron to 
proton conversion and introduce the fractional abundance Xn = Nn/ (nn + 
Np). Simple manipulation then leads to the equation 


dX, 
dt 





= np (a —~ X,)e@/T — Xn) (6.35) 


Converting from the variable t to the variable s = (Q/T) and using (d/dt) = 
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—HT(d/dT), the equations we need to solve reduce to 





dX pn 
—Hs TS Anp ((1— Xn)e™? — Xn); H = (1.1 sec™!) s74; 
0.29 s~} 
Anp = = [s? + 6s +12]. (6.36) 


It is now straightforward to integrate these equations numerically and de- 
termine how the neutron abundance changes with time. The results (of a 
more exact treatment) are shown in Fig. 6.2. The neutron fraction falls 
out of equilibrium when temperatures drop below 1 MeV and it freezes to 
about 0.15 at temperatures below 0.5 MeV. 
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Fig. 6.2 Primordial abundance of elements as a function of time. (Data for the figure: 
Courtesy Jim Kneller.) 


As the temperature decreases further, the neutron decay with a half life 
of Tn œ 886.7 sec (which is not included in the above analysis) becomes 
important and starts depleting the neutron number density. The only way 
neutrons can survive is through the synthesis of light elements. As the 
temperature falls further to T = Tye ~ 0.28MeV, significant amount of He 
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could have been produced if the nuclear reaction rates were high enough. 
The possible reactions which produces tHe are [D(D,n) ?He(D,p) *He, 
D(D,p) 7H (D,n) 4He, D(D,7) tHe]. These are all based on D, *He and 
3H and do not occur rapidly enough because the mass fraction of D, *He 
and 3H are still quite small [107!7,10-!9 and 5 x 10719 respectively] at 
T œ 0.3MeV. The reactions n + p = d + will lead to an equilibrium 
abundance ratio of deuterium given by 





pin 4 (mpmn\*”? (QnkgT)*” e-B/kBT 
nan 3 


3 \ ma (2rh)? n 


= exp [25.82 — nQgh? T? — (F) . (6.37) 
10 


The equilibrium deuterium abundance passes through unity (for Qgh? = 
0.02) at the temperature of about 0.07 MeV which is when the nucleosyn- 
thesis can really begin. 

So we need to determine the neutron fraction at T = 0.07 MeV given 
that it was about 0.15 at 0.5 MeV. During this epoch, the time-temperature 
relationship is given by t = 130 sec (T/0.1 MeV)~*. The neutron decay 
factor is exp(—t/T») 0.74 for T = 0.07 MeV. This decreases the neutron 
fraction to 0.15 x 0.74 = 0.11 at the time of nucleosynthesis. When the 
temperature becomes T < 0.07MeV, the abundance of D and 3H builds up 
and these elements further react to form *He. A good fraction of D and 
3H is converted to *He (See Fig.6.2 which shows the growth of deuterium 
and its subsequent fall when helium is built up). The resultant abundance 
of 4He can be easily calculated by assuming that almost all neutrons end 
up in tHe. Since each He nucleus has two neutrons, (n,/2) helium nuclei 
can be formed (per unit volume) if the number density of neutrons is np. 
Thus the mass fraction of 4He will be 

4(nn/2) _ _2(n/p) 

Y= aot era (n/p) = 22%, (6.38) 
where xe = n/(n+p) is the neutron abundance at the time of production of 
deuterium. For Qgh? = 0.02, ze © 0.11 giving Y ~ 0.22. Increasing baryon 
density to Qgh? = 1 will make Y ~ 0.25. An accurate fitting formula for 
the dependence of helium abundance on various parameters is given by 


Y = 0.226 + 0.025 log mo + 0.0075(g — 10.75) + 0.014(71 /2(n) — 10.3 min) 
(6.39) 
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where 719 measures the baryon-photon ratio today via the relation 





3 
Ngk? = 3.65 x 107? ( ) mo (6.40) 


0 
2.73 K 


and g. is the effective number of relativistic degrees of freedom contributing 
to the energy density and 71,2(n) is the neutron half life. 

As the reactions converting D and 3H to He proceed, the number 
density of D and 3H is depleted and the reaction rates - which are pro- 
portional to T œ X,4(nn ) < ov > - become small. These reactions soon 
freeze-out leaving a residual fraction of D and 3H (a fraction of about 1075 
to 1074). Since T œ 7 it is clear that the fraction of (D,° H) left unreacted 
will decrease with 7. In contrast, the “He synthesis - which is not limited 
by any reaction rate - is fairly independent of 7 and depends only on the 
(n/p) ratio at T ~ 0.1MeV. The best fits, with typical errors, to deuterium 
abundance calculated from the theory, for the range ņ = (1071° — 107°) is 
given by 


D 50.06 n Ti 
Y= | = =3. 1079255 > 5 ; 
2 (5). 3.6 x 10 (sn) (6.41) 


The production of still heavier elements - even those like 16C, 160 which 
have higher binding energies than tHe - is suppressed in the early universe. 
Two factors are responsible for this suppression: (1) For nuclear reactions to 
proceed, the participating nuclei must overcome their Coulomb repulsion. 
The probability to tunnel through the Coulomb barrier is governed by the 
factor F = exp[—2A!/3 (Z1 Z2)?/3(T/1MeV)~!/3] where A~! = AT'+ Aj". 
For heavier nuclei (with larger Z), this factor suppresses the reaction rate. 
(2) Reaction between helium and proton would have led to an element with 
atomic mass 5 while the reaction of two helium nuclei would have led to an 
element with atomic mass 8. However, there are no stable elements in the 
periodic table with the atomic mass of 5 or 8! The ®Be, for example, has a 
half life of only 10716 seconds. One can combine *He with ®Be to produce 
12C but this can occur at significant rate only if it is a resonance reaction. 
That is, there should exist an excited state !2C nuclei which has an energy 
close to the interaction energy of tHe + Be. Stars, incidentally, use this 
route to synthesize heavier elements. It is this triple-alpha reaction which 
allows the synthesis of heavier elements in stars but it is not fast enough 
in the early universe. (You must thank your stars that there is no such 
resonance in !®O or in Ne — which is equally important for the survival 
of carbon and oxygen.) 
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The current observations indicate, with reasonable certainty that: (i) 
(D/H) > 1x 10-5. (ii) [(D + He)/H] ~ (1 — 8) x 1075 and (iii) 0.236 < 
(4He/H) < 0.254. These observations are consistent with the predictions 
if 10.3min < 7 < 10.7 min, and 7 = (3 — 10) x 101°. Using 7 = 2.68 x 
1078 gh?, this leads to the important conclusion: 0.011 < Qgh? < 0.037. 
When combined with the broad bounds on h, 0.6 < k S 0.8, say, we can 
constrain the baryonic density of the universe to be: 0.01 S$ Qg < 0.06. 
These are the typical bounds on Qg available today. It shows that, if 
Qtotal ~ 1 then most of the matter in the universe must be non baryonic. 

Since the *He production depends on g, the observed value of 4He 
restricts the total energy density present at the time of nucleosynthesis. 
In particular, it constrains the number (N,) of light neutrinos (that is, 
neutrinos with m, < 1MeV which would have been relativistic at T ~ 
1MeV). The observed abundance is best explained by N, = 3, is barely 
consistent with N, = 4 and rules out N, > 4. The laboratory bound on 
the total number of particles including neutrinos, which couples to the Z° 
boson is determined by measuring the decay width of the particle Z°; each 
particle with mass less than (m,/2) ~ 46 GeV contributes about 180 MeV 
to this decay width. This bound is N, = 2.79 + 0.63 which is consistent 
with the cosmological observations. 


6.3 Decoupling of matter and radiation 


In the early hot phase, the radiation will be in thermal equilibrium with 
matter; as the universe cools below kgT ~ (€a/10) where €a is the binding 
energy of atoms, the electrons and ions will combine to form neutral atoms 
and radiation will decouple from matter (see Eq. (3.75)). This occurs at 
Taec ~ 3 x 103 K. As the universe expands further, these photons will con- 
tinue to exist without any further interaction. It will retain thermal spec- 
trum since the redshift of the frequency v x a7! is equivalent to changing 
the temperature in the spectrum by the scaling T « (1/a). It turns out that 
the major component of the extragalactic background light (EBL) which 
exists today is in the microwave band and can be fitted very accurately by 
a thermal spectrum at a temperature of about 2.73 K. It seems reasonable 
to interpret this radiation as a relic arising from the early, hot, phase of the 
evolving universe. This relic radiation, called cosmic microwave background 
radiation, turns out to be a gold mine of cosmological information and is 
extensively investigated in recent times. We shall now discuss some details 
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related to the formation of neutral atoms and the decoupling of photons. 

The relevant reaction is, of course, e + p = H + y and if the rate of 
this reaction is faster than the expansion rate, then one can calculate the 
neutral fraction using Saha’s equation developed in chapter 3, Sec. 3.4.1. 
Introducing the fractional ionisation, X;, for each of the particle species 
and using the facts np = ne and np + ny = np, it follows that Xp = Xe 
and Xy = (ny/ng) = 1 — Xe. Saha’s equation Eq. (3.73) now gives 


1-X 
x E & 3.84n(T/me)?/? exp(B/T) (6.42) 





where 7 = 2.68 x 10~8(Qgh?) is the baryon-to-photon ratio. We may define 
Tatom as the temperature at which 90 percent of the electrons, say, have 
combined with protons: i.e. when Xe = 0.1. This leads to the condition: 

(Qgh?)-17-2 exp [-13.67-1] = 3.13 x 10738 (6.43) 
where 7 = (T/leV). For a given value of (Qgh?), this equation can be 
easily solved by iteration. Taking logarithms and iterating once we find 
T1 & 3.084 — 0.0735 In(Qgh?) with the corresponding redshift (1 + z) = 
(T/To) given by 


(1 +z) = 1367[1 — 0.024 In(Qgh?)|7!. (6.44) 


For Ngh? = 1,0.1,0.01 we get Tatom & 0.324eV, 0.307eV, 0.292eV respec- 
tively. These values correspond to the redshifts of 1367,1296 and 1232. 


Because the preceding analysis was based on equilibrium densities, it 
is important to check that the rate of the reactions p+e e H +7 is 
fast enough to maintain equilibrium. For Qgh? = 0.02, the equilibrium 
condition is only marginally satisfied, making this analysis suspect. More 
importantly, the direct recombination to the ground state of the hydrogen 
atom — which was used in deriving the Saha’s equation — is not very ef- 
fective in producing neutral hydrogen in the early universe. The problem 
is that each such recombination releases a photon of energy 13.6 eV which 
will end up ionizing another neutral hydrogen atom which has been formed 
earlier. As a result, the direct recombination to the ground state does not 
change the neutral hydrogen fraction at the lowest order. Recombination 
through the excited states of hydrogen is more effective since such a re- 
combination ends up emitting more than one photon each of which has an 
energy less than 13.6 eV. Given these facts, it is necessary to once again use 
the rate equation developed in the previous section to track the evolution 
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of ionisation fraction. A simple procedure for doing this which captures the 
essential physics is as follows: 

We again begin with Eq. (6.31) and repeating the analysis done in the 
last section, now with nj = ne, n2 = Np, N3 = Ny and n4 = ny, and defining 
Xe = Ne/(ne + NH) = Np/NH one can easily derive the rate equation for 
this case: 


= — [ea — Xe) — ana Xè] =a (Ea — Xe) — x2) : (6.45) 





This equation is analogous to Eq. (6.35); the first term gives the photoion- 
isation rate which produces the free electrons and the second term is the 
recombination rate which converts free electrons into hydrogen atom and 
we have used the fact ne = nyXe ete.. Since we know that direct recom- 
bination to the ground state is not effective, the recombination rate a is 
the rate for capture of electron by a proton forming an excited state of 
hydrogen. To a good approximation, this rate is given by 


1/2 
a = 9.78rec (=) In (3) - (6.46) 


where ro = e? /mec? is the classical electron radius. To integrate Eq. (6.45) 
we also need to know (/a. This is easy because in thermal equilibrium the 
right hand side of Eq. (6.45) should vanish and Saha’s equation tells us the 
value of Xe in thermal equilibrium. On using Eq. (6.42), this gives 





3/2 
f- (35) aol (BITI, (6.47) 
We can now integrate Eq. (6.45) using the variable B/T just as we used the 
variable Q/T in solving Eq. (6.35). The result shows that the actual recom- 
bination proceeds more slowly compared to that predicted by the Saha’s 
equation. The actual fractional ionisation is higher than the value pre- 
dicted by Saha’s equation at temperatures below about 1300. For example, 
at z = 1300, these values differ by a factor 3; at z ~ 900, they differ by a 
factor of 200. The value of Tatom, however, does not change significantly. A 
more rigorous analysis shows that, in the redshift range of 800 < z < 1200, 
the fractional ionisation varies rapidly and is given (approximately) by the 
formula, 


x _3 (Qnrh?)!/2 z \12.75 
Xe = 24x 10° T (aa) (6.48) 
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This is obtained by fitting a curve to the numerical solution. 

The formation of neutral atoms makes the photons decouple from the 
matter. The redshift for decoupling can be determined as the epoch at 
which the optical depth for photons is unity. Using Eq. (6.48), we can 
compute the optical depth for photons to be 


t z dt z \ 14.25 
T= | n(t)Xe(t)ordt = [ n(z)X-(z)or ($) dz ~ 0.37 (a) 

(6.49) 
where we have used the relation Hodt S -QR 2,-5/2qz which is valid 
for z > 1 (see Eq. (6.14)). This optical depth is unity at zgec = 1072. 
(Alternatively, one can compare the scattering rate of photons neor = 
Xenpor with the expansion rate H. This calculation shows that neor/H 
is of order unity when Xe ~ 107? if Qgh? = 0.02 and Qyr = 0.3 leading 
to substantially the same conclusion.) From the optical depth, we can also 
compute the probability that the photon was last scattered in the interval 
(z,z+ dz). This is given by (exp—7) (dr/dz) which can be expressed as 


ae z \13.25 z \14.25 
P(z) = TZ =5.26 x 10 (<<) exp |-0.7 (=) |; 
(6.50) 


This P(z) has a sharp maximum at z ~ 1067 and a width of about Az & 80. 
It is therefore reasonable to assume that decoupling occurred at z ~ 1070 
in an interval of about Az ~ 80. We shall see later that the finite thickness 
of the surface of last scattering has important observational consequences. 


6.4 Formation of dark matter halos 


Having discussed the life history of the homogeneous universe, we shall now 
turn to the issue of formation of different structures like galaxies, clusters 
etc in the universe. 

A totally uniform universe, of course, will never lead to any of the inho- 
mogeneous structures seen today. However, if the universe had even slight- 
est inhomogeneity, then gravitational instability can amplify the density 
perturbations. To see how this comes about in the simplest context, let us 
consider a universe with a single component of matter with the equation of 
state p = wp. The w = 0 case is the most important but we shall have occa- 
sion to use others as well; so we will keep a general w. Integrating d(pa?) = 
—pda*, we find that the background density evolves as p x a~°('+”), Fur- 
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ther, differentiating Eq. (6.2), we get @ = —(47G/3)(1 + 3w)pa which has 
the solution a = (t/to)?/(#+3”) describing the background universe. We now 
perturb a(t) slightly to a(t)+da(t) so that the corresponding fractional den- 
sity perturbation is 6 = (p/p) « (6a/a). Using the form of the background 
solution, it is easy to show that the perturbation da satisfies the equation 


_ 2(1+ 3w)(2 + 3w) 


åa 
2 =. ewe 


2 
f d 


= ma; (6.51) 
This equation has power law solutions da œ t? with p(p — 1) = m. Solving 
the quadratic equation and choosing the growing mode, we find that the 


density contrast grows as 


ĝa _ 2(14+3w) 


This result can be expressed more concisely. It can be directly verified that 
this expression scales exactly as (oa?)~! where p is the background density. 
Thus we get the result that 


ô x (pa)~}. (6.53) 


In the matter dominated universe p œ a~* and the density perturbation 
grows as 6 x a. 

When the perturbations have grown sufficiently, their self gravity will 
start dominating and the matter can collapse to form a gravitationally 
bound system. The dark matter will form virialised, gravitationally bound, 
structures with different masses and radii. The physics of such dark matter 
halos is somewhat easier to understand since they are not affected by gas 
dynamical processes. In contrast, the baryonic matter will cool by radiating 
energy, sink to the centers of the dark matter halos and will form galaxies. 
The formation of galaxies, which is more complex, will be discussed in the 
next chapter. But to discuss properly the features of a scenario for the 
formation of dark matter halos, we need a model that will describe the 
nonlinear epochs of the collapse. We will develop that model first. 

Let us consider a spherically symmetric region of radius, R(t), which has 
a density p that is higher than the background density pp. In a spherically 
symmetric context, the matter outside this region does not influence the 
evolution of this over dense region. The radius R(t) will grow at a rate 
slightly slower than the background expansion (because of the over-density 
inside), reach a maximum value and contract back again. This model is 
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particularly simple for the Q = 1, matter dominated universe, in which we 
have to solve the equation 


ËR GM 

aR 
The density contrast of non relativistic matter in this region will be given 
by: 


(6.54) 


„2 __3M__1 _2GM /a(t)) 
= ST ae RG se) Hg rol l 


Solving Eq. (6.54), the evolution of a spherical over-dense region can now 
be summarized by the following relations: 


(6.55) 


R(t) = A(1 — cos 8), (6.56) 


t= B(@—sin@); A? =GMB?. (6.57) 


Substituting into Eq. (6.55) and using Ht? = 4/9 we find that the density 
and density contrast evolve as: 


— gi 2 
ptt) = pelt) se eg (6.58) 
_ 9(6—sin 6)? 


Given the solution in Eq. (6.56), Eq. (6.57) we can equate the energy of 
the motion to the initial kinetic energy of expansion, K; = (1/2)H?r? and 
thus fix A and B. We get, for 6; < 1, the result A ~ r;/26;, B = 3t;,/462/?. 
For small t, this leads to ô = (3/5)6;(t/t;)?/? which — in turn — fixes the 
overall normalisation. We can now express 6 and @ in terms of the redshift 
by using the relation (t/t;)?/* = (1+ z)(14+z)71. This gives 


ee) ar oi = (3) oi = gas (6-60) 


where ĝo = (3/5)6;(1 + zi) is the linearly extrapolated density contrast 
today, starting from 6 = 6; at z = z;. Given an initial density contrast ô; at 
redshift z;, these equations define (implicitly) the function 6(z) for z > zi. 
Equation Eq. (6.60) defines 6 in terms of z (implicitly); Eq. (6.59) gives the 
density contrast at that 6(z). 
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For comparison, note that linear evolution (with 6, œ a œ t?/3) obtained 
earlier (from Eq. (6.53) for p x a~*) leads to a density contrast 6; where 


D 36;(1 + zi 3 (3 2/3 e 
5, = bb =e saata) =; (3) (0 — sing)?/?, (6.61) 


We can estimate the accuracy of the linear theory by comparing ô(z) and 
ôL(z). To begin with, for z >> 1, we have 6 < 1 and we get ô(z) ~ 
ôL(z). When 6 = (1/2), 6, = (3/5)(8/4} 3 (r/2 — 1)?/3 = 0.341 while 
6 = (9/2)(7/2 — 1)? — 1 = 0.466; thus the actual density contrast is about 
40 percent higher. When 8 = (27/3), ðr = 0.568 and 6 = 1.01 ~ 1. If we 
interpret 6 = 1 as the transition point to nonlinearity, then such a transition 
occurs at 0 = (27/3), ôL ~ 0.57. From Eq. (6.60), we see that this occurs 
at the redshift (1 + znı) = 1.066;(1 + z;) = (60 /0.57). 

The spherical region reaches the maximum radius of expansion at 0 = 7. 
From our equations, we find that the redshift zm, the proper radius of the 
shell rm and the average density contrast ôm at ‘turn-around’ are: 

bi(1 + zi 
5 do ~ 90 


= 3 (8/42/38 7 1,062’ 


3a /P fo Qn? Py 
The first equation gives the redshift at turn-around for a region, 
parametrized by the (hypothetical) linear density contrast ĝo extrapolated 
to the present epoch. If, for example, 6; ~ 107? at z; ~ 104, such a pertur- 
bation would have turned around at (1+ zm) = 5.7 or when zm ~ 4.7. The 
second equation gives the maximum radius reached by the perturbation. 
The third equation shows that the region under consideration is nearly six 
times denser than the background universe, at turn-around. This corre- 
sponds to a density contrast of ôm œ% 4.6 which is definitely in the nonlinear 
regime. The linear evolution gives 6, = 1.063 at 0 = 7. 

After the spherical overdense region turns around it will continue to 
contract. Equation (6.58) suggests that at @ = 27 all the mass will collapse 
to a point. However, long before this happens, the approximation that 
matter is distributed in spherical shells and that random velocities of the 
particles are small, (implicit in the assumption of homogeneous over dense 
region) will break down. The collisionless (dark matter) component will 
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relax to a configuration with radius fyir, velocity dispersion v and density 
Pcol. After virialization of the collapsed shell, the potential energy U and 
the kinetic energy K will be related by |U| = 2K so that the total energy 
E =U+K=-K. At t = tm all the energy was in the form of potential 
energy. For a spherically symmetric system with constant density, E ~ 
~3GM?/5rm. The ‘virial velocity’ v and the ‘virial radius’ ryi, for the 
collapsing mass can thus be estimated by the equations: 











Mv? 3GM? 3GM? 
K= =-€ = : = = = 2 : 
5 5, l= = 2K = Mv (6.63) 
We get: 
v= (6GM/5r,)'/?; vir = Tm/2. (6.64) 


The time taken for the fluctuation to reach virial equilibrium, teoll, is es- 
sentially the time corresponding to 0 = 27. From equation Eq. (6.60), we 
find that the redshift at collapse, Zeon, is 





_ 6;(1 + zi) ta , yy = ôo 
(+ zon) = aaga 7 36C + 24) = 0.63(1 + 2m) = Teas: 
(6.65) 


The density of the collapsed object can also be determined fairly easily. 
Since ryir = (Tm/2), the mean density of the collapsed object is Pcon = 8Pm 
where pm is the density of the object at turn-around. We have, pm = 
5.6po(tm) and Polim) = (1+ zm)? (1+ zcon)~3p0(tcou). Combining these 
relations, we get 


Pcoll S Bom ~ 44.804(tm) fax 170ps(tcou) ~ 170po(1 + Zeol)? (6.66) 


where po is the present cosmological density. This result determines pecol 
in terms of the redshift of formation of a bound object. Once the system 
has virialised, its density and size does not change. Since pẹ « a`’, the 
density contrast ô increases as a? for t > teo. The evolution is described 
schematically in Fig. 6.3. 

Such virialised, gravitationally bound, structures — once formed — 
will remain frozen at a mean density f which is about fe ~ 200 times 
the background density of the universe at the redshift of formation, z. 
Taking the background density of the universe at redshift z to be pyg(z) = 
pcQnR(1+ z}, the mean density J of an object which would have collapsed 
at redshift z is given by p ~ Onrpcf-(1+z)?. We define the circular velocity 
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Fig. 6.3 Modelling the nonlinear growth of the density contrast using the dynamics of a 
spherical overdense region. The two curves indicate the density evolution of a spherical 
overdense region and the background universe. 


Ue for such a collapsed body to be, 


= — = —— pr’. (6.67) 


Eliminating fp in terms of ve, the redshift of formation of an object can be 
expressed in the form 


200 M (ve /200 km 57+)?” 


Onr fe (r/h-} Mpe)?” 


It is interesting that such a fairly elementary calculation leads to an ac- 
ceptable result regarding the redshift for the formation of first structures. 
If one considers small scale halos (about a few kpc), the formation redshift 
can go up to, say, 20. This calculation also introduces the notion of hierar- 
chical clustering in which smaller scales go non-linear and virialise earlier 
on and the merging of these smaller structures leads to hierarchically bigger 





(1+z) 5.8 ( (6.68) 
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and bigger structures. Of course, the process is supplemented by the larger 
scales going non-linear by themselves but the importance of mergers cannot 
be ignored in galaxy formation scenarios. 

` The description so far concentrated on a single spherical patch in the 
universe which was overdense. In practice, we are more interested in de- 
scribing the statistical properties of the structures in the universe rather in 
the dynamics of individual patches. At any given moment of time, the ac- 
tual density of matter p(t, x) will differ from the averaged out density peg (t) 
and the density contrast ô(t, x) = (p(t, x) — Ptg)/Pbg provides a description 
of the inhomogeneous universe. To switch from this to a statistical descrip- 
tion, it is better to concentrate on the spatial Fourier transform 6,(t) of 
the density contrast 6(t,x). We will assume that the density fluctuation 
dx (t) is just one particular realization of a random processes. Then, one 
can define the power spectrum of fluctuations at a given wavenumber k 
by P(k,t) =< |dx(t)|? > where the averaging symbol denotes that we are 
treating P(k, t) as a statistical quantity averaged over an ensemble of possi- 
bilities; statistical isotropy of the universe implies that this power spectrum 
can only depend on the magnitude |k| of the wave number. This quantity 
provides a suitable statistical measure of the fluctuations in the universe 
which we shall use extensively. The power per logarithmic band in k will 
be given by 

Aj (t) = Kè (lök (t)? = K*P(k, t) (6.69) 
Qn? 27? 

(Quite often we will just use P(k, t) = |6,(t)|? without bothering to show 
the averaging symbol (---) explicitly.) For smoothly varying power spec- 
trum, this quantity is related to the mean square fluctuation in density (or 
mass) at the scale R ~ k~! in the universe by 


2 2 
ee (2) Gs (Sr) ~ 92(R,t). (6.70) 
P / Rak- M J Rak 


If we approximate the power spectrum of fluctuations in the universe by 
a power law in k locally then P(k) « k” at any given time. When the 
perturbation is small, (6 < 1) we can use the result derived above, ô « a, 
to obtain 


A2(t) x a2k"t8; — o?(R, t) x a? R-(+3) (6.71) 


as long as ø < 1 with n being a slowly varying function of scale k or R. 
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When o(f,t) — 1 that particular scale characterized by R will go non 
linear and matter at that scale will collapse and form a bound structure. 
Since this occurs when the density contrast o reaches some critical value 
e & 1, it follows from Eq. (6.71) that the scale which goes non linear at any 
given time t in the past (corresponding to a redshift z) obeys the relation 


Ryu (t) x a(t)?/+9 = Ry (to)(1 + 2)77/ 0+3, (6.72) 


Equivalently, structures with mass M œ R3,, will form at a redshift z where 


Myx (z) = Myx (to)(1 + z)°/*), (6.73) 
3 
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Fig. 6.4 The fraction of the critical density N(M) contributed by collapsed halos with 
mass M, plotted against the density contrast at that mass scale, at different redshifts. 
Given the abundance of structures at different redshifts, one can constrain the o at that 
mass scale. The abundance of massive clusters is over plotted on the z = 0 curve to 
illustrate this procedure. 


It is, in fact, possible to work out many properties of collapsed struc- 
tures from the above formalism. To do this, let us consider a spherical 
region of radius R which had very small mean fluctuation 6; at some ini- 
tial time t = t;. The o(M,t;) in Eq. (6.71) denotes the typical value of 
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density fluctuations which exists in the universe at a time t;. In a statis- 
tical description, we could think of regions having a density contrast, say, 
v times larger than the typical value o(M,t) occurring with a probability 
P(v) which — in turn — could be approximated as a Gaussian of unit 
variance in most models at early times. Later on, the fluctuation grows, 
turns around, virialises and forms a bound structure. We know that this 
happens when the density contrast, computed using linear theory, becomes 
e © 1.68. Let 4;(t,t;) be the density contrast needed at time t; so that 
ô = ôe = 1.68 by the time t. For the Q = 1, matter dominated universe, 
6;(t,t;) = 60(t;/t)?/3. As a first approximation, we may assume that a 
region with 6 > 6; (t,t;) (when smoothed on the scale R at time t;) will 
form a gravitationally bound object with mass M œ pR? by the time t. 
Therefore, the fraction of bound objects with mass greater than M will be 


foe) 


FRCS AE ay 6: f P(6, R, ti)dô (6.74) 
ôi(t,ti) 


x ow |. ex (-sama) dé x erfe (5) 
Van o(R, ti) Js, (t,t) p 20?(R, ti) V2o(R, ti) 


where erfc(x) is the complementary error function. This expression can be 
written in a more convenient form using 


ôilt ti) _ Selti/t) 3 be 6-(1 + z) 


o(R,ti)  o(R,tj)  o1(R,t)  oo(M) (6.75) 


where o1(R,t) = o(R,t;)(t/t;)?/7is the linear density contrast at t and 
d0(M) is its value at present t = to. We can express it either in terms of 
the mass M = (47/3)R3 or in terms of R depending on the convenience. 
The overall normalization of fc needs to be done by hand, so that the 
integral of feou(M) over all M is unity. (There are some subtleties about 
normalizing this expression, which we shall not go into.) Then we get, in 
terms of redshift, 


ost) | (6.76) 


feon (> M, z) = erfc EST 
This magical formula (called the Press-Schechter formula) gives you non- 
linear results for the price of linear effort! Differentiating Eq. (6.76), one 
can compute the number of halos with mass between M and M + dM as 
well as the fraction of the critical density Q(M) contributed by halos in 
this mass range. You can easily convince yourself that Q(M) has a fairly 
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simple expression (since you are now differentiating what you originally got 


by integrating a Gaussian!): 
{2 y? 
= == 77 
mY exp ( 5 ) (6.77) 


where v = 6,(1+ z)/o(M). If you want the baryon density in the collapsed 
halos, you only need to multiply this expression by Q3/OQnr. Figure 6.4 
gives 0(M) for a more realistic universe with Qur = 0.35, Qy = 0.65 with 
a more precise linear evolution numerically coded in. 

An interesting application of this result is to constrain cosmological 
models by the known abundance of classes of objects like e.g. galaxy clus- 
ters. From the fact that typical cluster has a mass of about 5 x 10!4Mo 
and an abundance of about 4 x 107%h3 Mpc~3, one can work out the Q 
contributed by the clusters to be about 8 x 107%. Because of various un- 
certainties in these measurements, this could vary anywhere between, say 
8 x 1074 to 2.4 x 107%, that is, by a factor 30. This range of Q at z = 0 is 
also shown in the Fig. 6.4 by a thick line superposed on the z = 0 curve. 
You will notice that, because of the steep rise of the z = 0 curve, the ob- 
served range of Q maps to a very tiny range in ø = (0.5 — 0.8) ! In other 
words, at the scale corresponding to the mass of the clusters, the density 
contrast must be in this range for consistency with observations. Since a 
mass of 10'4 Mo corresponds to a radius of about 8 h~! Mpc with the cur- 
rent cosmological density, one often states this result by saying that at the 
length scale of 8 h7! Mpc the density contrast should be in the range 0.5 to 
0.8 — or, more simply, that og = (0.5 — 0.8). You will hear a lot about og 
in the study of structure formation since it is a fairly powerful constraint. 


ding 
dln M 





Q(M) = 








6.5 Generation of initial perturbations 


For the cosmological paradigm described in the previous sections to work, 
we need to have a mechanism to generate small perturbations in the early 
universe. This turns out to be more non trivial than one would have at 
first imagined, because of the following reason. 

Consider a perturbation at a proper wavelength À which will expand 
with the universe as A(t) « a(t) = Aoa(t) where Ao is the wavelength at the 
present epoch at which a = ag = 1. We need to compare this length scale 
— which characterizes the inhomogeneity — with the characteristic length 
scale for the expansion of the universe. The latter, of course, is the Hubble 
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radius, given by dy (t) = (@/a)~!. If the universe is matter or radiation 
dominated, then a(t) « t” with n < 1. In this case, A(t) œ t” will be bigger 
than dy(t) œ t at sufficiently small t. 

This result leads to a major difficulty in conventional cosmology. Nor- 
mal physical processes can act coherently only over length scales smaller 
than the Hubble radius. Thus any physical process leading to density per- 
turbations at some early epoch, t = t;, could only have operated at scales 
smaller than dy(t;). But most of the relevant astrophysical scales (corre- 
sponding to clusters, groups, galaxies, etc.) were much bigger than dy(t) 
at sufficiently early epochs. Thus, if we want the seed perturbations to have 
originated in the early universe, then it is difficult to understand how any 
physical process could have generated them. To tackle this difficulty, we 
must arrange matters such that at sufficiently small t, we have A(t) < dy(t). 
If this can be done, then physical processes can lead to an initial density 
perturbation. 

We will first show that dy(t) will rise faster than a(t) as t — 0 if the 
source of expansion of the universe has positive energy density and positive 
pressure. Thus the above problem cannot be tackled in any Friedmann 
model with p > 0 and p > 0 at all times. To see this, we begin by noting 
that as t — 0, we can ignore the k/a? term in Eq. (6.2) and write 











2 3 

dB = 5 l. 6.78 

H a?  8rGp (Gu) 
So 

dindy ldinp 3 p\ dina 

Se = =-{14++ 6.79 

dt 2 dt 2 ( z) dt (6:79) 

where we have used the relation d(pa?}) = —pda?. The proper wavelengths 


scale as A x a(t) giving d(InA) = d(Ina). Combining with Eq. (6.79), we 
get 





dindy 3f, p 
=n =]. 6.80 
dink 2 (1+ 2) (9:09) 


For (p/p) > 0, the right hand side is greater than unity. So dy grows as 
Aà? with n > 1 during the early epochs of evolution. Of course, n itself 
could vary with time if (p/p) is not a constant; but we will always have 
the condition n > 1. It follows that (dy/A) < 1 for sufficiently small a 
proving that the Hubble radius will be smaller than the wavelength of the 
perturbation at sufficiently early epochs (as a — 0), provided (p/p) > 0. 
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Hence, in order to generate density perturbations from a region which was 
smaller than Hubble radius, it is necessary to have models with either p 
or p negative. But to satisfy the Friedmann equation with k = 0 we need 
p > 0; thus the only models in which we can have dy > A for small t are 
the ones for which p < 0. 

One simple way of achieving this is to make a(t) increase rapidly with 
t (for example, exponentially or as a œ t” with n > 1) for a brief period 
of time. Such a rapid growth is called inflation. In conventional models of 
inflation, the energy density during the inflationary phase is provided by a 
scalar field with a potential V(¢). If the potential energy dominates over 
the kinetic energy, such a scalar field can act like an ideal fluid with the 
equation of state p = —p. Since the field is assumed to be inherently quan- 
tum mechanical, it will also have characteristic quantum fluctuations. It is 
possible for these quantum fluctuations to eventually manifest as classical 
density perturbations. The situation is illustrated schematically in Fig. 6.5 
which shows the Hubble radius dy = (a/a)~! by a thick line and the wave- 
length of a given perturbation A(t) « a(t) by a thin line. If the inflation is 
exponential with a(t) x exp(Ht), then the wavelength of the perturbation 
increases exponentially during the inflationary phase while the Hubble ra- 
dius remains constant. This allows the wavelength of the perturbation to 
start with a size which is smaller than the Hubble radius and grow to a 
value that is bigger than the Hubble radius during the inflationary phase 
(the wavelength will be equal to the Hubble radius at the point marked A 
in the figure; one usually says that the perturbation ‘leaves’ the horizon at 
A). After the inflation ends — at the epoch marked by a broken vertical 
line — the Hubble radius dy(t) œ t « a? grows faster than the wavelength 
à xa and catches up with it. That is, the perturbation again ‘enters’ the 
Hubble radius at B. The idea now is that quantum fluctuations of a field 
at scales smaller than the Hubble radius at A can lead to classical energy 
density fluctuations at B. 

To do this, we have to discuss little bit of field theory in the Friedman 
universe. We will do this in two easy steps, in case you are not familiar with 
it. First, recall that in classical mechanics, the dynamical equation for a 
particle moving in a potential is given by g(t) = —OV/0q. When we proceed 
from a point particle (for which the dynamical variable q(t) depends only 
on time t) to a field $(¢,x) (which depends on both space and time), the 
time derivatives will be supplemented by spatial derivatives, with the usual 
minus sign in special relativity. Thus, in special relativity, we will have the 
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Fig. 6.5 The behaviour of Hubble radius (thick lines) and the wavelength of a pertur- 
bation (thin line) in an inflationary universe. 


field equation for a scalar field: 
94 


— 2 = 
aps Ot? r 


ƏV 
-—. 6.81 
I (6.81) 
(We are using units with c = 1). Next we need to generalise this equation 
from flat spacetime to FRW background. This is easy if we remember that, 
when we proceed from Cartesian coordinates to spherical coordinates, we 
just use the replacement 
bd Z Zo 1 ô ap OG 


aa? * dy o yg aaa V9” Baa) 882) 


where gag is the three dimensional metric with a, = 1,2,3 and g is the 
determinant of gag. (We are also using the convention that any index 
that is repeated in a term is summed over its range of values.) The same 
applies in four dimensions (with the indices now running over 0, 1, 2, 3) 
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and Eq. (6.81) becomes, in the k = 0 FRW universe: 


1 8 sô% 


2 = —— 
ayes - 5 V4 l (6.83) 


The a? factors in the first term comes from \/—g = a3; the V? gets changed 
to a`? V? because of the expansion of spatial scales. This is what we will 
work with. 

Consider a scalar field (t,x) which is nearly homogeneous in the sense 
that we can write o(t,x) = o(t) + d¢(t,x) with ô < ¢. We will think 
of d(t) as the mean, homogeneous, part of the scalar field and ĝġ as due 
to quantum fluctuations around the mean value. Let us first ignore the 
fluctuations and consider how one can use the mean value to drive a rapid 
expansion of the universe. Since ¢ is independent of spatial coordinates, 
the the equation of motion Eq. (6.83) reduces to 


1d ( 3d¢ _wW 
TIG %)- b+anb=-Z (6.84) 


It is easy to verify that this equation is the same as d(pga*) = —p¢da’, if 
we attribute to the scalar field the energy density pg and pressure pg given 
by 


po = zg +V(¢); P= ig -V(¢) (6.85) 


where V(q) is the potential for the scalar field (There are more formal ways 
of getting this result from the energy momentum tensor but this will do for 
our purpose). The Einstein’s equation (for the k = 0 Friedmann model) 
with the above source can be written in the form 


a= H(t) = 8rG z] 


V(¢) + 54. (6.86) 


In principle, Eq. (6.86) and Eq. (6.84), can be integrated to give a(t) 
and ¢(t) once the form of V(¢) is given. The actual nature of solutions, 
of course, will depend critically on the form of V(¢) as well as the initial 
conditions. Among these solutions, there exists a subset in which a(t) is 
a rapidly growing function of t, either exponentially or as a power law 
a(t) x ¢* with an arbitrarily large value of a. We shall now provide a 
general procedure for determining a form of V (¢) such that any reasonable 
evolution characterized by a function a(t) can be reproduced using a scalar 
field. 
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It is fairly easy to verify that the solutions to Eqs. (6.86), (6.84) can be 
expressed in the parametric form 


A x 1/2 
t+ an|; nE J dt 725 (6.87) 


Given an a(t) these equations give V(t) and ¢(t) and thus determine V(¢) 
implicitly. Thus Eqs. (6.87) completely solves the problem of finding a 
potential V(¢) which will lead to a given a(t). As an example of using 
Eq. (6.87), let us design a universe in which a(t) = agt*. Elementary 
algebra using Eq. (6.87) now gives the potential to be of the form 


V(¢) = Yoexp (-y nt s) (6.88) 


where Vo and a are constants. The corresponding evolution of ¢(t) is given 
by 


3H? 


WO) = arā 








_ {a \i2 87GVo 
elt) = ae) i ( a(3a — 1) :) ' (6:89) 
A second example, consider an evolution of the form 
B 


a(t) x exp(Atf), f O0<f<1, A>0. (6.90) 


= T+" 


In this case, we can determine the potential to be 


V($) « 6-8 (1 eae 5) (6.91) 
48nG ¢? 

where @ is a constant. Both these examples show that it is indeed possible 

to have rapid growth of a(t) for a suitable choice for the potential. We shall 

next enquire about the general conditions V(¢) should satisfy in order to 

lead to this behaviour. 

One possible way of achieving this is through potentials which allow 
what is known as slow roll-over. Such potentials have a gently decreasing 
form for V(@) for a range of values for ¢ during which ¢(t) evolves very 
slowly. Assuming a sufficiently slow evolution of ¢(t) we can ignore the 
ġ term in Eq. (6.84). Similarly, we can ignore the kinetic energy term $? 
in comparison with the potential energy V(¢) in Eq. (6.86). In this limit, 
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Eq. (6.86) and Eq. (6.84) become 


87GV (¢) _ dv 
2 Br = T de’ 
Let us assume that the slow roll-over approximation is valid until a time 
t = tena when ¢ = pena. The amount of inflation can be characterized 
by the ratio a(tena)/a(t); a large value for this number indicates that the 
universe expands by a large factor from the time t till the time teng. Since 
this is a large number in most models, it is conventional to work with the 
quantity N(t) = In[a(tena/a(t)]. From Eq. (6.92) it is easy to obtain an 
expression for N in terms of the potential V(¢); 


H? ~ ; 8Hd¢~-V'(¢) (6.92) 


t tend $ V 
Nain Send) [M uates] Vag. (6.93) 
a(t) t Pena V 
This provides a general procedure for generating and quantifying the rapid 
growth of a(t) arising from a given potential. The validity of slow roll over, 
in turn, requires the following two parameters to be sufficiently small: 


oe hae 1 v" 

e(d) = TénG ($) > nd)= mG Ve (6.94) 
Of these, € has to be small in order to ignore the kinetic energy in compar- 
ison with the potential energy, while 7 has to be small in order to ignore 
the acceleration of the scalar field. The end point for inflation can be taken 
to be the epoch at which € becomes comparable to unity. 

We now turn to the inhomogeneous part d¢@(t, x) of the scalar field and 
consider the spectrum of density perturbations which are generated from 
the quantum fluctuations of the scalar field. This requires the study of 
quantum field theory in a time dependent background which is quite non- 
trivial. We shall, therefore, adopt a heuristic approach, as follows: 

Let us assume that, during the inflationary phase, the universe expands 
exponentially as a(t) «x exp(Ht). Then all the frequencies will be redshifted 
exponentially, w(t) x exp(—Ht). We saw in Sec. 5.4 that such a situation 
is equivalent to the existence of a temperature [see Eq. (5.54)]. This shows 
that quantum fluctuations in an exponentially expanding universe behave 
like thermal fluctuations at temperature T = (H/2r) — just as in the case 
of black holes. (The analysis is identical to that which led to Eq. (5.54) 
from Eq. (5.51) with H replacing «.) Hence the scalar field will have an 
intrinsic rms fluctuation 6¢ ~ T = (H/2r) in the inflationary phase at the 
scale of the Hubble radius. This will cause a time shift ôt ~ 5¢/¢ in the 
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evolution of the field between patches of the universe of size about H71. 
This, in turn, will lead to an rms fluctuation A = (k?P)}/? of amplitude 


. 2 
ae (5) aao (6.95) 
a a 27h 
at the Hubble scale. Since the wavelength of the perturbation is equal to 
Hubble radius at A (see Fig. 6.5), we conclude that the rms amplitude of 
the perturbation when it leaves the Hubble radius is: A4 ~ H?/(27@). 
Between A and B (in Fig. 6.5) the wavelength of the perturbation is 
bigger than the Hubble radius and one can show that Ay = A(at A) x% 
A(at B). To see this, let us consider the density fluctuations at A, B and 
at C which is the epoch at which the universe makes the transition from 
the inflationary phase to the radiation dominated phase. Using Eq. (6.53), 
which gives 6 « (pa?)~!, we know that p = constant and 6 « a~? during 
the inflationary phase, while p œx a~* and 6 œ a? during the radiation 
dominated phase. This gives 


Bs 


c _ ay 6p a 
OA a2,’ ôc a 


(6.96) 


Qn 


Further, we know that (ac/ag) = (tc/tp)!/? = (Hp/Hc)!/* = 
(Hp/Ha)'/?. The first equality follows from a œ t!/? in radiation dom- 
inated phase and the last equality follows from the fact that Ho = Ha. 
Therefore, we get 


az, = ab (2) (6.97) 


Combining the two relations in Eq. (6.96) and using Eq. (6.97) we get 


(=) _ AIB _ 4AB Ha _ aaHa _ r (6.98) 
A 


a2, a Hg apHg 
The last result follows from the fact that the points A and B are defined 
as the exit and entry at which (27/k)a = H~' making aH = (k/2r) = 


constant at these two epochs, implying a4H, = agHg. We thus conclude 
that 








H? 
A(at B) = A(at A) = —. (6.99) 

20g 
Since this expression is independent of k, all perturbations will enter 
the Hubble radius with constant power per decade. That is A*(k,a) œ 
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k? P(k, a) is independent of k when evaluated at a = Genter (k) for the rele- 
vant mode. Observations suggest (see Sec. 6.6) that Ag ~ 1075. 

From this, one can show that P(k,a) « k at constant a. To see this, 
note that if P œ k”, then the power per logarithmic band of wave numbers 
is A? œ k?P(k) œ k”+3, Further, when the wavelength of the mode is 
larger than the Hubble radius, during the radiation dominated phase, the 
perturbation grows as 6 œ a? making A? œ atk("+5), The epoch Genter 
at which a mode enters the Hubble radius is determined by the relation 
2rdenter/k = dy. Using dy œ t x a? in the radiation dominated phase, we 
get Genter X k7} so that 


A?(k, Genter) X afpterk "t œ k7D, (6.100) 


So if the power A? (k, Genter) per octave in k is independent of scale k, at 
the time of entering the Hubble radius, then n = 1. In fact, a prediction 
that the initial fluctuation spectrum will have a power spectrum P = Ak” 
with n = 1 was made by Harrison and Zeldovich, years before inflationary 
paradigm, based on general arguments of scale invariance. Inflation is one 
possible mechanism for realising the scale invariance. 

As a specific example, consider the case of V(¢) = Ad*, for which 
Eq.(6.93) gives N = H?/2)¢? and the amplitude of the perturbations at 
Hubble scale is: 

2 
eae 3H? œ Vales N3/2, (6.101) 
$ v' 
If we want inflation to last for reasonable amount of time (N > 60, say) 
and demand that A œ~ 1075 (as determined from CMBR temperature 
anisotropies; see Sec. 6.6) then we require A < 10715. This has been a 
serious problem in virtually any reasonable model of inflation: The param- 
eters in the potential need to be extremely fine tuned to match observations. 

The qualitative reason for this result is the following: The fluctuations 
arises from (6a/a) = Hét ~ H(6¢/¢). To obtain slow roll-over and suffi- 
cient inflation we need to keep ¢ small; this leads to an increase in the value 
of 6. The difficulty could have been avoided if it were possible to keep 5¢ 
arbitrarily small; unfortunately, the inflationary phase induces a fluctuation 
of about (H/27) on any quantum field due to the non zero temperature. 
This lower bound prevents us from getting acceptable values for 5 unless 
we fine-tune the dimensionless parameters of V(¢). 

At the end of inflation, one expects the energy in the scalar field trans- 
fered to other matter fields, though the details of this process are somewhat 
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model dependent. By and large, this leads to the same relative changes in 
the number density of all species leaving the relative abundances invari- 
ant. In particular, the photon-to-baryon ratio (n,/ng) does not change. 
Since ny « T? x entropy of radiation, the condition 6(n.,/ng) = 0 leads to 
the entropy per baryon remaining the same. Such fluctuations are called 
isoentropic or adiabatic. 


6.6 Temperature anisotropies in the CMBR 


We have seen in Sec. 6.3 that the photons in the universe decoupled from 
matter at a redshift of about 10°. These photons have been propagat- 
ing freely in spacetime since then and can be detected today. In an ideal 
Friedmann universe, a comoving observer will see these photons as a black- 
body spectrum at some temperature To. The deviations in the metric from 
that of the Friedmann universe, motion of the observer with respect to 
the comoving frame, the astrophysical processes which take place along the 
trajectory of the photon, can all lead to potentially observable effects in 
this radiation. These effects can be of two kinds: (i) The spectrum may 
not be strictly blackbody; that is, the radiation intensity J(w) at different 
frequencies, coming to us from a particular direction in the sky, may not 
correspond to a Planck spectrum with a single temperature To. (ii) The 
spectrum may be Planckian in any given direction; but the temperature of 
the radiation may be different in different directions; that is To = To(0, >) 
where (0, 6) are the angular coordinates on the sky. 

The first kind of deviation is not detected in the CMBR and there 
exist tight observational bounds on the spectral deviations of CMBR from 
the ideal blackbody form. This (negative) result is useful in constraining 
the models describing structure formation, especially those aspects which 
are related to the intergalactic medium. On the other hand, CMBR does 
show the distortions of the second kind at different angular scales. Direct 
measurements of these distortions, which are currently available at different 
angular scales, turn out to be very restrictive. Because of this reason, we 
shall concentrate on the angular anisotropies. 

A physical process characterized by a length scale L at z = Zdec, will 
subtend an angle 0(L) = [L/da(zdec)] in the sky today where d,(z) is the 
angular diameter distance. We saw in Sec. 6.1 that dA = Tem(z)(1+ z)7! 
where Trem(z) is given by Eq. (6.16). For large z, this reduces to rem(z) © 
2H OR 2 so that, for z > 1, the angular diameter distance becomes 
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da(z) = 2H5'(/Onr z)~!, giving 
a ONR Lz T n 2 1/2 Ao 
UL) & Ce =) = 34.4" (Onrh?) Tee (6.102) 


(It is conventional to quote the numerical values with the length scales 
extrapolated to the present epoch. Thus ào = L(1 + Zdec) = Lzaąec is the 
proper length today which would have been L at the redshift of zaec. We 
will not bother to indicate this fact with a subscript ‘0’ when no confusion 
is likely to arise.) In particular, consider the angle subtended by the region 
which has the size equal to that of the Hubble radius at zgec; that is, we 
take L to be 

diz(zaec) = H7! [z = zaec] = Hp (ONRZaec) 1/7252 (6.103) 


dec 
so that 
à = di (Zdec){1 + zdec] & diz (Zdec)Zdec & Hy (QNRZdec)71/?. (6.104) 
Then Eq. (6.102) gives: 





= 
614 = O(dyz) S 0.87° ( =.) ? we, (6.105) 


Therefore, angular separation of more than one degree in the sky would 
correspond to regions which were bigger than the Hubble radius at the 
time of decoupling. 

Let us now estimate the magnitude of the temperature anisotropy 
(AT/T) that is caused by different processes. We begin with the intrinsic 
anisotropy in the radiation field at large angular scales, 0 > 04. 

If baryons and photons are tightly coupled, then we expect the 
anisotropy in the energy density of the radiation, dr, to be comparable 
to dg. The exact relation between these two depends on the initial process 
which generated the fluctuations. As we mentioned at the end of Sec. 6.5, 
inflationary models generate fluctuations which are isoentropic, in the sense 
that, the photon to baryon ratio (which is proportional to the entropy per 
baryon) is unaffected by the fluctuation. These isoentropic fluctuations are 
characterized by 


0= 8 |] = SR [2z = “ez . (6.106) 
nB NB | SR PB 


Using sr « T? and pr œ T4, it follows that (ŝsr/sr) = (3/4)ôr. There- 
fore, for isoentropic fluctuations, we will have (3/4)ôr = dg. In general, we 
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expect ôg x dpm at large scales in which gravity dominates over pressure 


forces. Hence, 

AT 

(=) x ÔDM. (6.107) 
T int 


To determine the angular scale over which this effect is significant, we 
proceed as follows: Each Fourier mode of dpm(k), labelled by a wave vector 
k, will correspond to a wavelength À œ k~t and will contribute at an angle 
6 given by Eq. (6.102). The mean square fluctuation in temperature will 
therefore scale with the angle @ as 


2 
(F) œ BkP(k) « E PEK? oœ ATBtM o g- Btn) (6.108) 
int 
for 6 > ĝu. If n = 1, which is the preferred choice, then (AT/T) in œ 072. 
Consider next the contribution to (AT/T) from the velocity of the scat- 
terers in the last scattering surface (LSS). If a particular photon which we 
receive was last scattered by a charged particle moving with a velocity v, 
then the photon will suffer a Doppler shift of the order (v/c). This will 
contribute a (AT/T)? ~ (v?/c?). At large scales, (for \ > dy), the velocity 
of the baryons is essentially due to the gravitational force exerted by the 
dark matter potential wells and v = gtgec X tdecV captures the spatial 
variation of v on the LSS. By an analysis similar to the above, we conclude 
that 


2 
ar T dk Epik œ ATO) œx Q-A+") (6.109) 
T J Dpp kêt k 


for 0 > Oy. If n = 1, then (AT/T)Dopp « 807}. 

Finally, let us consider the contribution to (AT/T) from the gravita- 
tional potential along the path of the photons. When the photons travel 
through a region of gravitational potential ¢, they undergo a redshift of 
the order of (¢/c”). The gravitational potential is related to the perturbed 
density contrast by V2¢ = 4nGpa?d. (Note that pô = (dp) is the per- 
turbed density; the a? factor arises in transforming from VŽ to V2 using 
r = ax.) In the Q = 1, matter dominated universe, p x a~? and ô x a 
so that 47Gpa?é is independent of time making the perturbed gravita- 
tional potential independent of time, i.e., ọ(t, x) = ¢(x). We may there- 
fore estimate the temperature anisotropies due to gravitational potential as 
(AT/T) œ (¢/c?) where ¢ is the gravitational potential on the LSS. This 
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contribution to temperature anisotropy is usually called Sachs- Wolfe effect. 
Its angular dependence will be 


AT? 3, P(k) dk qe 





for 6 > 6x. If n = 1, this contribution is a constant. 
Let us next consider the anisotropies at small angular scales 6 < Oy. It 
can be shown by a more detailed analysis that the contribution (AT/T)2w 
varies as 64 and hence is not very significant at small 6. As regards 
(AT/T)?,, and (AT/T)bopp: we should note the following: At small scales, 
the photon-baryon fluid will have strong pressure support and will oscillate 
as an acoustic wave. To study the properties of this acoustic wave and 
its effect on (AT/T), we need to obtain some preliminary results. First 
of all, note that acoustic disturbances in a photon-baryon fluid propagates 
with the speed c, = (Op/Op)'/? = (p/p)'/?. Since p = pr = (1/3)pr, $ = 
ÔR + fp, it follows that 
2_1 1 Ad 1 1 
* 31+(s/R) 31+ (3p8/4pr) 3 


if we ignore the contribution of baryons. Second, we note that the phase of 
the wave will vary as 


c (6.111) 


of = csk & ge (6.112) 
where we have defined a new time coordinate n by dn = dt/a. This follows 
from the fact that the equation to the light ray is given by ds = 0. Given 
all these, you will expect the photon-baryon fluid to oscillate in time as 
cos|(kn/v3)], say. But the CMBR is observed from a fixed surface 7 = Mrec- 
On this surface, the density fluctuations will vary in space in the same 
manner. The crusts and troughs of the wave will now be separated by 
Ak œ (V31/nrec). The corresponding length scale is AA =% (Nrec/ V37). 
This length scale will subtend an angle 6 = (A/D) where D & no — Nrec © 
no is the distance up to the recombination epoch. Using 7 « a!/? in the 
matter dominated phase, we get 





Ad 1 ae 2505 _ 
Gi er, = (14 zrec) 1/2? & 5.8 x 1073 ~ 0.33°. 
D V30 ( ao an! rec) 
(6.113) 


A more exact calculation gives a slightly larger value. This shows that 
we will expect alternating maxima and minima in AT/T in the sky at 
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small scales separated by about a degree or so. There is, however, a minor 
subtlety. Both the density fluctuations and velocity fluctuations will lead 
to these oscillations but they will be opposite in phase in a pure radiation 
fluid. We get a net effect only because of the existence of baryons which 
prevents them from canceling each other completely. The angular scale we 
have computed, however, is unaffected by this effect. 

There is, however, another effect which needs to be taken into account 
at small angular scales. The above analysis assumes that decoupling took 
place at a sharp value of redshift zge,. In reality, the LSS has a finite 
thickness Az 7% 80 (see Sec. 6.3, Eq. (6.50)). Any photon detected today 
has a probability P(z)dz to have been last scattered in the redshift interval 
(z,z+dz). The observed (AT/T) has to be computed as 


(FF) = fo aie if the last } x P(2). (6.114) 


scattering was at z 


We saw in Sec. 6.3 that P(z) is very well approximated by a Gaussian 
peaked at z = Zdec with a width Az ~ 80. This width corresponds to a line 
of sight (comoving) distance of 


Az 


01/2372 
ONR Ždec 


Al=c (=) Az: (1+ Zdec) S H3! z8 (Anrh) 7? Mpc. 


(6.115) 
Hence the anisotropies at length scales smaller than Al will be suppressed 
due to the finite thickness of LSS. This length scale corresponds to the 
angular scale of about faz ~ 3.8 Q2, 

Another effect which operates at small scales is process called photon 
diffusion. As recombination proceeds, the diffusion length approaches infin- 
ity and hence can be important. The actual length scales below which this 
effect can wipe out fluctuations is somewhat model dependent but could be 
about laim ~ 35 Mpc, corresponding to aia ~ 24’. This result can be un- 
derstood as follows: Consider a time interval At in which a photon suffers 
N = (At/I(t)) collisions. Between successive collisions it travels a proper 
distance I(t), or - equivalently - a coordinate distance [I(t)/a(t)]. Because 
of this random walk, it acquires a mean-square coordinate displacement: 

2 
(Az)? =N G) ae Ae): (6.116) 


a} ` I(t)a2 a 


The total mean-square coordinate distance traveled by a typical photon till 
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the time of decoupling is 


tac dt taecl (tdec 
os f I(t) = Š tect ltdec) (facc) 
0 


AO (t) = 5 a2 (tec) (6.117) 


where we have used the fact that I(t) = [on-(t)]"' ~ 1079 cm (1 + 
z)~3(Qgh?)-! and that a œ t?/3. This corresponds to the proper distance 


NBh? 
0.02 





-1/2 
) (h3). 


(6.118) 
The effect of this diffusion is equivalent to multiplying each mode (AT/T); 
in the Fourier space by the Gaussian exp[—k?(Al)?/2] = exp[—(k/kr)?] 
with kp ~ 0.02hMpc"'. 


3 1/2 
lait = Q(tdec) 2 = tant (tas) œ~ 35Mpc ( 


2 2 
(AT/T), (ATI Ting 





2 
(A T/T) obs 


-10 
10 





Fig. 6.6 The angular pattern of CMBR temperature anisotropies due to different phys- 
ical processes are indicated schematically. See text for discussion. 


Taking all these effects into account, we arrive at the final pattern of 
anisotropies which will look like the one in Fig. 6.6. The contribution at 
large angular scales due to intrinsic anisotropy, Doppler effect and Sachs- 
Wolfe effect are shown by the dotted line, dot-dash line and the dashed line 
respectively. The dot-triple dash line indicates the Gaussian cutoff due to 
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the photon diffusion. The final thick line clearly shows the peaks (called 
acoustic peaks) and the suppression at small scale. At large scales, the 
anisotropies are dominated by the Sachs-Wolfe effect and are independent 
of the angular scale. As we move to smaller scales, other contributions add 
up making (AT/T) increase. At scales 0 < beq, the oscillations in 6g and 
v make their presence felt and we see characteristic acoustic peaks. At still 
smaller scales, the effects due to finite thickness of LSS drastically reduce 
the amplitude of fluctuations. 

The fact that several different processes contribute to the structure of 
angular anisotropies makes CMBR a valuable tool for extracting cosmolog- 
ical information. To begin with, the anisotropy at very large scales directly 
probe modes which are bigger than the Hubble radius at the time of de- 
coupling and allows us to directly determine the primordial spectrum. As 
we have seen, the effect of initial fluctuations in the scalar gravitational 
potential leads to a temperature anisotropy with the angular dependence, 
AT/T œ à!” œ 01-7”, In particular, if AT/T is independent of the angu- 
lar scale at large angles, then the spectrum is scale invariant with n = 1 ina 
Q = Onr = 1 universe. The behaviour is different if there is a cosmological 
constant (or if Q < 1), but in a predictable way. Thus, in general, if the 
angular dependence of the spectrum at very large scales is known, one can 
work backwards and determine the initial power spectrum. 

The angles subtended by the acoustic peaks will depend on the geom- 
etry of the universe and provides a reliable procedure for estimating the 
cosmological parameters. Detailed computations show that: (i) The mul- 
tipole index | corresponding to the first acoustic peak has a strong, easily 
observable, dependence on Riot and scales as lp ~ 2209;.1/ 2 if there is no 
dark energy and Qo: = Qur. (ii) But if both non-relativistic matter and 
dark energy is present, with Onr + Opg = 1 and 0.1 S OQnr S 1, then 
the peak has only a very weak dependence on Qne and lp œ= 220044. 
Thus the observed location of the peak (which is around | ~ 220) can be 
used to infer that Qro4 ~ 1. More precisely, the current observations show 
that 0.98 S Qiot S 1.08; combining with h > 0.5, this result implies the 
existence of dark energy. 

The heights of acoustic peaks also contain important information. In 
particular, the height of the first acoustic peak relative to the second one 
depends sensitively on Ng and the current results are consistent with that 
obtained from big bang nucleosynthesis. 
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Exercises 


Exercise 6.1 Spheres and planes: 


The purpose of this exercise is to motivate the form of 3-dimensional space in 
Eq. (6.3) as one which is homogeneous and isotropic. An obvious example of 
space which is homogeneous and isotropic is a plane which has the metric ds? = 
dr? + r?d6? and a second example will be the surface of a sphere with metric 
ds? = da? + sin? ad8?. Show that one can write these metrics together in the 
form ds? = dx?(1 — kz?)~1 + 2?dé” with k = (0,1). Further argue that this 
metric represents a homogeneous and isotropic space even when k = —1. Finally 
generalize the result from two to three dimension and obtain Eq. (6.3). 


Exercise 6.2 Age of the Universe: 


Another quantity of interest in Friedmann universe, which we did not discuss, is 
its age. Since the age of the universe should be larger than the age of any object 
in it, this can be used to constrain cosmological models. Consider the age of a 
universe with Qy + Qnr = 1, ignoring all other components. It is convenient 
to use the variable y = (a/av) where Nya’, = Qnr. (So that y = 1 when the 
matter and cosmological constant contributes equal amounts to energy density.) 
Show that y(t) = sinh?/*(3Hyt/2) where H2 = (8tGpv/3). Also show that the 
product tH(t) is given by 


L pN\1/2 
tH(t) = spe aa (54) | (6.119) 


where f = (1+ y*)~}. Use this to estimate the current age of the universe for 
the two models with (Qnr = 1, Qy = 0) and (Qnr = 0.3, Qv = 0.7). 


Exercise 6.3 Non standard nucleosynthesis: 


Determine whether the primordial nucleosynthesis will produce more (or less) 
helium and deuterium compared to the standard model if: (i) there are many 
more neutrinos than anti neutrinos (or photons) in the universe, (ii) there are 
many more anti neutrinos than neutrinos (or photons) in the universe, (iii) there 
is a significant contribution from gravitational radiation to the energy density of 
the early universe. 


Exercise 6.4 Approximating a sphere: 


Equations (6.59), (6.61) give 6 in terms 5, only in parametric form through the 
angle 0. Try an approximate fit of the form 1+ ô = [1 — (6L/n)|~” with n as a 
free parameter. Determine n so that this relation fits the spherical evolution as 
accurately as possible. Comment on the result. 
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Exercise 6.5 Putting asunder what gravity has put together: 


We saw in chapter 4 that a supernova explosion releases about 105! erg of energy. 
Show that this hydrodynamic energy can, in principle, unbind the baryonic gas in 
the dark matter halos at z > 5. Take the binding energy of baryons to be about 
a factor (QB /Npm) % 0.15 less than the binding energy U computed above. 


Exercise 6.6 Inflation with V(¢) = Ad?: 


It is possible to satisfy the conditions of slow roll over even in fairly simple 
potentials. As an example, let us consider the case of V(¢) = Ad* with ¢(t) 
evolving from some initial value ¢; at a = a; towards ¢ = 0. Show that the slow 
roll over solutions are given by 





= pi exp |- soi, (t- J ; (6.120) 
a = a; exp (i fi-a |- EN 1 \) ; (6.121) 


Determine when the inflation ends and the value of N in such a model. 


Exercise 6.7 Angular diameter distance: 


Show that the angular diameter distance in a universe with Onr + Qy = 1 is 


c dx 


(1+z) 
Gin Onar L (1 — Onn i2: 6.122 
A Ho(1 + z) I [Qnrx3 + (1 = Nnr) ( ) 


Plot this as a function of redshift for different values of Qnr. Use this to estimate 
the angle subtended by the comoving scales 1h~' Mpc, 10h7! Mpc, 100h7! Mpc 
and 3000h“! Mpc at z = 10°. 


Exercise 6.8 Ruling models out: 


Comparing the spectrum of a galaxy at the redshift z = 1.5 with computer 
simulation suggests that it is about 3 Gyr old. Check whether this is consistent 
with a Qur = 0.3, Qy = 0.7,h = 0.7 universe assuming that stars can form only 
after the protogalactic material has turned around, collapsed and virialized. 


Exercise 6.9 Lensing distance: 


While discussing gravitational lenses in chapter 1, we needed the expressions for 
distances between source and the lens ete.. We now know that one can define 
different kinds of distances (angular diameter distance, luminosity distance, ...) 
so one needs to be careful in using the lensing formula in curved spacetime. (a) 
Argue that the correct distance scale to use for gravitational lensing is the angular 
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diameter distance. (b) Show that in the case of a flat universe with Nur = 1 the 
relevant distance between redshifts z; and z; is given by 


1/2 yo ; .)1/2 
Ho (1+ zi)(1 +z)? 


Exercise 6.10 Real life astronomy: CMBR anisotropy: 


A favourite tool of theoreticians is the CMBFAST program which allows 

the computation of temperature anisotropies for different cosmological mod- 
els. This is also an excellent tool to learn how varying different parameters 
changes the pattern of anisotropies. You can download the program from 
http://www.cmbfast.org. Get it and install it properly in your computer. Run it 
and find the temperature anisotropy pattern for the following models: (a) Qy = 0 
with Qy pr varying from 0.2 to 1.0; (b) Qiot = 1 with Qy varying from 0.1 to 0.9. 
(c) Qy = 0.7, Qot = 1, Qgh? = 0.02 with Qg varying from 0.01 to 0.05; of course, 
you need to vary h to maintain the last constraint. (d) Same as in (c) but keep 
h = 0.72 rather than fixing Qgh?. See whether you can understand the varying 
patterns. 


Notes and References 


Modern cosmology is discussed in the textbooks Padmanabhan (2002); Peacock 
(1999) and Dodelson (2003). Figure 6.3 and Fig. 6.6 are adapted from chapters 
5, 6 of Padmanabhan (2002). For a radically different approach to cosmology, 
see Hoyle, Narlikar and Burbridge (2000). 

1. Section 6.1: The composition of the universe described here was obtained 
from several pieces of work. Some of the original references can be found in the 
review articles Padmanabhan (2003) and Padmanabhan (2005). 

2. Section 6.2: For a simple analytical model describing primordial nucle- 
osynthesis, see Esmailzadeh et al. (1991). 

3. Section 6.6: There exists an excellent code called CMBFAST which can 
compute the temperature anisotropies in the microwave radiation for a wide class 
of models and parameters; see Seljak and Zaldarriaga (1996) 


Chapter 7 


Universe at z < 20 


Let us now turn our attention to the formation of galaxies which arises 
from the non linear growth of the density perturbations in normal gaseous 
matter, made of baryons. This is, of course, the key problem in the study 
of structure formation since all structures which we observe directly are 
baryonic and most of them are in the non linear phase of gravitational 
collapse. For the same reason, this is also the area in which progress has 
been slow because of the inherent complexity of gas dynamics and radiative 
processes. We shall, therefore, rely heavily on simple minded approaches 
to understand this phase of the universe. 


7.1. Galaxy formation 


The description of gravitational collapse in the last chapter applies only to 
dark matter which is not directly coupled to radiation. Since there is at least 
six times more dark matter than baryons in the universe, the former defines 
the pattern of gravitational potential wells in which structure formation 
occurs. You saw in Sec. 6.4 that when the dark matter fluctuation at a 
given mass scale M becomes non-linear, collapses and virialises, it leads to 
a formation of a halo of mass M. The resulting dark matter halo could be 
described reasonably accurately as an isothermal sphere (see Sec. 3.2) with 
a given velocity dispersion v? and an asymptotic density distribution p(r) 
given by 
y2 


Plr) = saa (7.1) 


Based on the discussion in the previous chapter (see Eq. (6.76)), we can 
also work out the abundance of dark matter halos at different redshifts. 
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Let us now consider the baryons located in these dark matter halos. The 
baryonic gas can radiate energy, cool and sink to the centers of the dark 
matter halos. When the dark matter halos merge, the baryonic structures 
can survive provided they have condensed to sufficiently small size within 
these halos. This scenario suggests that it may be possible to understand 
the basic features of galaxies, at a broad level, by studying the interplay 
between cooling and gravitational collapse. As you shall see, it is indeed 
possible, among other things, to understand the characteristic sizes and 
masses of the galaxies by such simple arguments. 

To do this, let us consider in some detail how the baryons cool. Colli- 
sional excitation followed by radiative de-excitation allows the gas to lose 
its energy in random motions and thus cool. At the time of recombination, 
the temperature of the universe is kgT =~ 0.3 eV and as the universe ex- 
pands, the temperature decreases further. Since the excitation from n = 1 
to n = 2 state of hydrogen requires about 9 eV, the gas is too cold to be 
excited either by collisions or by radiation. But later on, when the baryons 
undergo nonlinear collapse in the potential wells of dark matter, they ac- 
quire velocities of the order of 100 km s~!. A proton with this velocity 
has an energy of about 50 eV which is more than enough to ionize the 
hydrogen atom in a collision. Once heated to the virial temperature of the 
dark matter potential well, the gas gets ionized and the resulting plasma 
can cool, mainly due to two different processes. radiation emitted during 
recombination of electrons and ions which, if it escapes the plasma, can 
be a source of recombination cooling. From the discussion in chapter 2, 
Sec. 2.5.3 we know that this rate varies as n?T~!/?. The second is the 
bremsstrahlung cooling with an energy loss rate proportional to n?T7?/? 
[see chapter 2, Sec. 2.4.1]. For baryons, virialized inside a dark matter halo 
and heated to the temperature kgT ~ (GMm,/R) which is much higher 
than the ionisation potential, a?m,.c?, bremsstrahlung cooling dominates 
over recombination cooling and so we will concentrate on bremsstrahlung. 
The cooling time for this process is (see Eq. (2.44)): 


5 El ee Nf TN LE = 
soot (ae /atdV) T (me) nad) (me) a8 IP 


which should be compared to the time-scale for gravitational collapse: 


GM -1/2 
tgrav = (=) . (7.3) 


The condition for efficient cooling tcool < tgrav, coupled to kpT ~ 
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GMmz,/R now leads to the constraint R < Rg with 


1/2 
R, = oF ag ie (=) ~ 74 kpe (7.4) 


e 


where ag = (Gm3/ħc) = 6 x 1073 is the gravitational (equivalent of) 
fine structure constant which you first met in chapter 4 in connection with 
the stars. The analysis assumed that kgT > a?m,c?; for R ~ R this 
constraint is equivalent to the condition M > M, with 


1/2 
M, ~ag oð (z) Mp ~3 x 10gm. (7.5) 


This result suggests that systems having a mass of about 3 x 1044 gm and 
radius of about 70 kpc could rapidly cool, fragment and form gravitationally 
bound structures. Most galaxies have masses around this region and this 
is one possible scenario for forming galaxies. It is remarkable that the 
mass and length scales in Eqs. (7.5) and (7.4) describing the galaxies arise 
entirely from the fundamental physical constants. 

Comparison with Eq. (4.7) shows that the number of stars Netar = 
(M,/M,) in a typical galaxy is also be given by the combination of funda- 
mental constants Ngtar = a7/ 2a" me /mp)!/4 ~ 10!?. Typical galaxies 
indeed have about (101! — 10+?) stars though there is a fair amount of spread 
in this number. The mean distance between the stars in a galaxy will be 
dstar © (Reat/Neiar) ~ 1 pe. 

You can also estimate a few more numbers fairly easily. The original 
(maximum) radius of the cooling plasma estimated above in Eq. (7.4) is 
about 50 kpc. After the matter has cooled and contracted, the final radius 
of a galaxy is more like (10 - 20) kpc. For Mg œ 3 x 10*4 gm and R, œ 20 
kpc, the density is pga: ~ 10~-?° gm cm7? which is about 10° times larger 
than the current mean density, pp ~ 1073? gm cm~, of the universe. If we 
assume (based on the discussion of spherical top-hat model in chapter 6, 
Sec. 6.4) that high density regions with Ø >= 100funiy would have collapsed 
to form the galaxies, then the galaxy formation must have taken place when 
the density of the universe was about(10°/100) = 1000 times larger; the 
value of a(t) would have been 10 times smaller and the redshift of the galaxy 
formation should have been zgai < 9. That appears to be reasonable. 

Further, if the protogalactic plasma condensations were almost touching 
each other at the time of formation, these centers (which would have been 
at a separation of about 2 x 74 ~ 150 kpc) would have now moved apart 
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due to cosmic expansion to a distance of 150(1+ zga1) kpc =œ 1500 kpc = 
1.5 Mpc. This is indeed the mean separation between the large galaxies 
today. The nearest galaxy of radius about 10 kpc, at a distance of 1 Mpc, 
would subtend an angle of 6,4) ~ 107? rad = 30’. A galaxy at the Hubble 
distance of about 4000 Mpc will subtend 0.5”. 

One can even venture to do a somewhat more detailed modeling based 
on these ideas. Suppose J is the density, averaged over a radius r, within 
a protogalaxy forming at a redshift z;. Based on the spherical top-hat 
model discussed in the last chapter, we can assume that such a structure 
will form if p 2 fepog(zf) where fe œ% 200 and ppg is the mean density of 
the background universe. This condition translates to 


p 3 vr 87G 2v2 
os ma ne aa e 
poe 4ar3 G 3HGQnr(1 + zf) OQnr (Hor)? (1 + zs) 








where ve is the circular velocity of the isothermal halo. You can now 
determine the redshift at which the mass inside the radius of, say, r = 
(10,30, 100)h-! kpc could have been assembled. Assuming that the cir- 
cular velocity of the protogalaxy is ve ~ 200 km s~!, we find that 
(1+ zf) = (7.6, 3.0, 1.5)Q-'/3 for r = (10,30, 100)! kpc. Among other 
things, this simple calculation shows that the galaxy formation is a fairly 
extended process in time. 

Incidentally, we note that still larger structures than galaxies, called 
galazy clusters, with masses of about 104” gm, radius of about 3 Mpc and 
mean density of 10~*” gm cm~® exist in the universe as gravitationally 
bound systems. Our argument given above shows that the gas in these 
structures could not have yet cooled and will have a virial temperature of 
about T ~ (GMm,/Rkp) ~ 4 x 10" K. This is also seen more directly 
by comparing the age of the universe with the free fall collapse time. The 
cosmic time t and the redshift z can be related to each other once the 
form of a(t) is known. In the case of a matter dominated universe with 
a(t) = (t/to)?/3 = (1+ z)~1, we have 


t(z) = to(1 +2)? = (2/3) H 1 (1+2)-3/? = 6.5h-1(14+z)—3/2 Gyr (7.7) 


while other cosmological models can change this value by a factor of few. 
On the other hand, the free fall time scale under gravity for a system of 
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mass M and initial radius Rinit is given by 


n2 \ 1/2 R. 3/2 M —1/2 
te={—}) REM x05 G 2 ee 
i (5) init Y” \ 100 kpc 102 Mo 


R \3/2 M -1/2 i 
~ 1.6 G ant ea =" 
ia (3 at) (sams) ue) 


The two lines give scalings appropriate for galaxies and clusters. Comparing 
Eq. (7.7) with the two lines of Eq. (7.8) we see that the peak epoch of galaxy 
formation should be in the range z ~ (2—10) depending on the cosmological 
models while cluster formation is probably a still ongoing process. 

Since the process of gravitational instability, leading to the condensation 
of galaxy like objects cannot be 100 per cent efficient, it would leave some 
amount of matter distributed (almost) uniformly in between the galaxies. 
The light from distant galaxies will have to pass through this matter and 
will contain signature of the state of such an intergalactic medium (IGM). 
The photons (with v > vr where hvr = 13.6 eV) produced in the first 
generation objects could cause significant amount of ionisation of the IGM, 
especially the low density regions. When a flux of photons (with v > vr) 
impinge on a gas of neutral hydrogen with number density ny, it will have 
an ionisation optical depth of T = nyot R. Setting r = 1 gives a critical 
column density for ionisation to be Np = ny R = où} ~ 10!” cm? (see 
the discussion after Eq. (2.108)). Regions with hydrogen column density 
greater than 10!” cm~? will appear as patches of neutral regions in the 
ionized plasma of IGM. Such regions can be studied by absorption of light 
from more distant sources, (especially through Lyman alpha absorption 
corresponding to the transition between n = 1 and n = 2 levels) and are 
called Lyman alpha clouds. We will discuss them in detail in Sec. 9.4. 

Let us next consider some aspects of the processes which actually gen- 
erate the luminosity of the galaxies. If a fraction (say, half) of the typical 
binding energy (GM?/R) of an average galaxy is radiated over the free fall 
time, (GM/R?)-*/?, then a simple calculation shows that the luminosity 
is about L ~ 5 x 10°Lo — which is not much. But then, galaxies mostly 
shine through starlight and not by radiating the gravitational potential en- 
ergy. Higher luminosities can be easily achieved by nucleosynthesis which 
produces the stellar light. About 30 MeV of energy is released for every 
baryon which is converted to metals in stellar nucleosynthesis. If the mass 
of metals produced is Mz = ZM,, where M, is the total mass burnt in 
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stars during the protogalactic star bursts, the energy released is 


M. 
Enuci = 30 MeV (=) = 3 x 107? ZM, e. (7.9) 
p 
Taking Z ~ Zo ~ 107? (which is the typical metallicity of a sun-like star) 
we find that (Enuci/M.c?) = AX ~ 3 x 1074. The energy released by this 
process will be 


Enuct ~ Myc? AX ~ 7.2 x 10° erg (sami) Ga) . (7.10) 
© 


If this energy is released in a time scale tg ~ 108 yr, the resulting luminosity 
is about 5 x 10!2Zo and the mean star formation rate would be several 
hundred solar mass per year. Even if the process proceeded on a more 
extended, hierarchical, time scale of 10° yr, the resulting luminosity will 
be comparable to that of a bright galaxy. This suggests that a generic 
protogalaxy will have a luminosity of Lgai ~ (10'° — 10!?)Lo. Taking a 
luminosity distance of about dr, ~ Hj‘'c ~ 2 x 1078 cm, we find that 
the expected bolometric flux will be about (10714 — 10-16) erg cm~? s71. 
Equating this bolometric luminosity to vF, we get the R band magnitude 
to be about 25, which — in more sensible units — is F, ~ 0.1 mJy at 
A = 0.3 mm. 

The above discussion also illustrates the important link between star 
formation, production of metals and production of starlight. These are 
related by: 

dMz dM, dE x dM, dE dMz 

aves” “ave caver ava ave ava 
Thus the density of starlight produced per second and the density of metals 
produced per second are proportional to the star formation rate. These are 
of use in different contexts in the study of structure formation. 

These processes eventually lead to a background of photons in different 
wave bands, usually called Extragalactic Background Light (called EBL) 
which we will discuss briefly. The unit for measuring the vI, of EBL can 
be conveniently taken to be nW m~? ster~! = 10~® ergs s~! cm~? ster! 
multiplying this quantity by (47/c) will give the spectral energy density 
U, = (4r /c)(vI„) with the dimensions of ergs cm~*. Dividing the energy 
density by pec? will give the density parameter Q contributed by the radi- 
ation background; roughly 1 nW m~? ster~! corresponds to Qh2, = 1077. 
The majority of this EBL at UV to IR wavelengths originates from the 
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starlight in the universe. In particular, the far UV (1000 — 1600) A light 
from the galaxies is produced by the same hot massive stars which also 
generate most of the metals. Thus the EBL in this band is a good tracer 
of star formation rate and also traces the enhancement of the metallicity 
in the universe. Observations have converged to a EBL flux in the range 
of (2—5) nW m~ sr“! in this band. (Of this, quasars and active galactic 
nuclei — which we will discuss in chapter 9 — probably contribute less than 
0.2 nW m~? sr! with the remaining amount arising from the galaxies.) 
Comparing the cumulative flux of detected sources in the optical with the 
EBL flux suggests that nearly 50 per cent of the flux arises from unresolved 
sources. These observations also suggest that the energy density does not 
vary too much over the spectral range (0.1 — 10%)y. 

The sources of EBL at optical (3000 A - 12) and IR (1—1000) are more 
complex. The spectral energy density of a conventional stellar population 
peaks around (1 — 1.5) during most of the stellar ages; hence the optical 
EBL will include older populations at low redshifts and younger population 
at high redshifts. More importantly even the normal stellar emission at UV 
- optical bands will indirectly contribute to IR band along the following 
route: During the period of active star formation, most of the light in a 
galaxy will be emitted in the optical and UV bands and the spectrum (if 
not obscured by dust) will be almost flat (J, ~ constant) up to the Lyman 
continuum cut-off at 912 A. This light is associated with the most luminous 
stars which have the shortest life time. The star formation rate of 1Mo 
yr! leads to a luminosity of about 2.2 x 10°Lọo. The presence of dust in 
galaxies lead to re-processing of the optical-UV light into FIR. This will 
lead to correspondingly large amount of FIR radiation from the primeval 
galaxies. i 

If the total bolometric intensity of star light is taken to be about 50 nW 
m`? ster~!, for the sake of estimate, then the corresponding energy density 
is 


U = 2 x 1074(14 Z) erg em7? = 1 x 1078(1 + Z) MeV cm™? (7.12 
g 


where (1 + Z) factor corrects for the cosmological redshift with Z denoting 
the mean epoch of emission. Each baryon that is converted to helium 
releases about 25 MeV of energy; this figure rises to a total of about 30 
MeV per baryon when we take into account the conversion from hydrogen 
to heavy elements. Dividing U by 30 MeV gives the mean baryon number 
density in metals that is involved in the production of background light to 
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be nz % 4 x 1071°(1 + z) cm~’. If we take the baryon number density to 
be ng = 1.1 x 1077 cm™3, then the ratio of mass fraction in heavy elements 
which can produce the observed EBL is Z ~ 4 x 1073(1 + Z). If the bulk 
of radiation was produced at Z = 2 then the corresponding metallicity is 
Z = 0.01. This gives the ball park figure for consistency between production 
of metals and EBL. 


7.2 Star formation history of the universe 


The previous discussion highlights the importance of understanding the 
star formation history of different types of galaxies in order to make useful 
predictions about them. We shall now discuss some aspects of this. 

Observationally the star formation in galaxies is studied by using differ- 
ent diagnostics all of which have different levels of reliability. One procedure 
is to study the UV emission from the galaxies — which — is directly con- 
nected to the formation of metals since the same massive stars which pro- 
duced the bulk of the UV photons also synthesize the metals and disperse 
them through supernova. In contrast, the relation between UV emission 
and total star formation rate is lot more problematic since the low mass 
stars of any standard IMF contains most of the mass while it is the high 
mass stars which produces most of the UV emission and the metals. Never- 
theless, there have been several attempts to connect the luminosity density 
in the high redshift galaxies with the results of low redshift surveys and 
obtain an overall picture of the star formation rate (SFR) as a function 
of redshift. These approaches estimate the amount of stars produced at 
any given epoch from the abundance of UV luminosity of the star forming 
galaxies. A simple phenomenological relationship between star formation 
rate and far UV luminosity, which seems to be consistent with observations, 
is 


SFR(Moyr7!) = 1.4 x 107 Lpuv(erg s7! Hz~*). (7.13) 


This relation applies to continuous star formation with a Salpeter initial 
mass function between (0.1 — 100)Mo. 

The key difficulty in using such a relation arises from unknown amount 
of dust obscuration in the star forming galaxies. If the ISM of primeval 
galaxies contains large amount of dust which absorbs UV radiation and 
converts it into IR radiation, then the estimated star formation rate will be 
lower than the actual rate. In fact, the presence of dust in galaxies can lead 
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to anything between 50 to 80 per cent of UV light being absorbed. There 
is also some uncertainty arising from the fact that stellar IMF is poorly 
known at the low mass end where most of the mass is contained. 

If we assume that the properties of the dust and intrinsic spectra of 
galaxies at high redshifts are similar to those of star burst galaxies in the 
local universe, then the correction is about factor 5 at z = 3. Different 
models and assumptions could vary this correction factor anywhere in the 
range of (2 - 5). This, in turn, suggests that the star formation rate ob- 
served in the universe is consistent with the assumption of near constancy 
at about 10-°8Mo yr 7! Mpc~? in a redshift range of (1 < z < 4.5). One 
convenient way of parameterizing the star formation rate as a function of 
redshift is through a fit 


Ae? 


TB (7.14) 


Psfr = 
with some constants A, B,a,b. Direct translation of UV luminosities sug- 
gests the parameters a = 3.4,b = 3.8,B = 44.7,A = 0.11Mọ Mpc? 
yr~!. Correcting for the dust could change the parameters of the fit to 
a = 2.2,b = 2.2, B = 6, A =0.13Mo Mpc". In the first case, the star for- 
mation decreases exponentially at high redshifts, while in the second case it 
stays nearly constant with redshift. (Of course, these fits are valid only in 
the range z < 5.) Dividing fs, by the critical density pe = 2.8 x 1011k? Mo 
Mpc~?, we get the rate of change of Os, with respect to time. Given the 
SFR as a function of z, one can also obtain the total amount of gas which 
has been processed into stars by some epoch t from the integral 


p(t) = 1 dz (=) a(z). | (7.15) 


A somewhat more relevant quantity is the amount of gas that has been 
converted into stars per Hubble time which is given by (a/å)ġx. This quan- 
tity — which is not constrained to be monotonic, unlike p,(t) — is shown 
in Fig. 7.1 for the two sets of fitting functions, covering an extreme range 
of assumptions regarding the dust. It is likely that the actual value is 
somewhere between these two curves at any given z. 

These curves illustrate several interesting points. First of all, note that 
most of the star formation activity is in the recent past; the peak activity 
over a Hubble time is around z ~ 1.2 (which corresponds to a cosmic time 
of about 3h;5 Gyr). This feature is more emphatically brought out by the 
time scales shown in the frames. The universe spends less time at high 
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Fig. 7.1 Rate of production of stars in the universe. The curves are fits to data based 
on the UV radiation background in the universe. The broken and unbroken lines give the 
amount of star formation within a Hubble time in units of Mọ Mpc~ for two different 
parameterization discussed in the text. (Figure based on data provided by C. C. Steidel.) 


redshifts and treated as a function of time rather than redshift, much of 
the star formation occurs recently. 

Second, one can estimate the total star formation up to any given time 
by integrating the star formation rate. The results could vary by a factor of 
few between the two curves but the typical value at z = 0 is 2, œ 5 x 107% 
which is adequate to account for the entire stellar content in disks and 
spheroids of the present day universe. Half of this value is reached at 
redshift of about z ~ 2 in these models. 

Further, the stars will also produce the metals in the universe and one 
could ask for the amount of metal enrichment at any redshift z due to 
a given star formation rate. From standard stellar evolution theory, one 
could make a rough estimate that the rate of production of metals which 
are ejected to enrich the surrounding medium to be fmetals © 2.4 x 1072p». 
For the SFR given above, the amount of metals produced by z = 2.5 is about 
0.04 Zo which is in rough agreement with the measurement of metallicity 
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in quasar absorption systems at high redshifts. 

This scenario could also account for, quite naturally, the existence of 
the cosmic infrared background radiation (CIBR) mentioned in Sec. 7.1. 
Observations suggest that this background has an intensity of about vI, © 
(24+5) aW m~? sr! at A = 1404m and vi, œ% (15 +5) nW m~? sr7} 
at à = 240um. The intensity is probably about vI, ~ (40+ 5) nW m~? 
sr! in the wavelength band À = (100 — 1000)um. This flux is larger than 
the “optical” background of about vI, ~ 17 nW m~? sr7! obtained by 
counting all the galaxies detected by HST in the band between 0.3 and 3 
um. To see what these numbers imply, let us make some simple estimates 
based on a fiducial star formation scenario. The mass density in baryons 
at a redshift z is: 


_ 3H3(1+2)8 


an Ns ~7 x 10°°(1 + 2)? h3 sMo Mpc™3. (7.16) 


PB 
If a fraction f, of the baryons undergoes processing, either through star 
formation or by gravitational accretion, with a radiative efficiency €, then 
the locally observed energy density of photons will be 


x ce fa 
py T PB (1 as z)4 


NBh? f 25 \f/ € 
= -30 ( BNE \ ( Je -3 
ROR ( 0.05 ) (&) (5) (oon) oe. 


(For a salpeter IMF with lower cut-off of 0.1Mo, one can use the standard 
stellar evolution models to computer €; one gets the efficiency to be about 
e ~ 1073 which is the value used in the above scaling). The intensity 
corresponding to this photon density will be J = (c/4m)(p,c”); equating 
this total energy to vI, we get: 


con (ho) (fe 2.5 € ae 
v I, = 20 ( at) (£) (5) (——-) nW m`? sr™! (7.18) 


which may be compared with the optical-UV background with a bolometric 
intensity of about 


(7.17) 








~ (17 +3) nW m`? sr! (7.19) 
opt 


vi, 





in the range À = (0.1 — 7)um due to distant galaxies. We are not too widely 
off. 
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In this picture, the optical emission arises from quiescent star formation 
in galaxies by intermediate and low mass stars. This is consistent with the 
assumption that 10 percent all baryons are processed in the low mass stars 
at z ~ 1.5. The local density in low mass stars will then be 


pp(stars) ~ 7 x 10! fx Ag ~ 3.4 x 10°9Mo Mpc? (7.20) 


which is consistent with observations based on photometric surveys and 
standard (M/L) ratios. Further, assuming solar metallicity, we can esti- 
mate the local density in metals to be 


Z - 
pz(stars) ~ 1.6 x 10° fx a QgMo Mpc? ~ 7.7 x 106Mo Mpe™’. (7.21) 
© 


The above discussion illustrates the close interplay between star forma- 
tion efficiency (which in turn is determined by the poorly known high z 
IMF), the intensity of extragalactic background radiation and the density 
of low mass stars in local neighbourhood. Any scenario should be able to 
account for all these observations. 


7.3 Intergalactic medium and the Gunn-Peterson effect 


We have seen in chapter 6 that formation of neutral matter and the de- 
coupling of radiation occurred at z ~ 10°. Since it is unlikely that the 
formation of structures at lower redshifts could have been one hundred 
percent efficient, we would expect at least a fraction of the neutral ma- 
terial, especially hydrogen, to remain in the intergalactic medium (IGM) 
with nearly uniform density. This neutral hydrogen could, in principle, be 
detected by examining the spectrum of a distant source, like quasar. (We 
will discuss quasars in detail in chapter 9; right now we are only using 
them as sources of photons produced at high-z.) Neutral hydrogen absorbs 
Lyman-a photons, which are photons of wavelength 1216 A that corre- 
sponds to the energy difference between the n = 1 and the n = 2 states of 
the hydrogen atom. Because of the cosmological redshift, the photons that: 
are absorbed will have a shorter wavelength at the source and the signa- 
ture of the absorption will be seen at longer wavelengths at the observer. 
We, therefore, expect the spectrum of the quasar to show a dip (known as 
Gunn-Peterson effect) at wavelengths on the blue side (shortwards) of the 
Lyman-a emission line if neutral hydrogen is present between the source 
and the observer. 
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The magnitude of this dip depends on the neutral hydrogen density and 
can be calculated using the optical depth for such absorption. The general 
formula for calculating the optical depth 7, of course, is 


rimiz] = fancar= f o(v)n(z) | 


In the case of Lyman-a photon, the absorption cross section is 


=| dz; v = w(1 +z). (7.22) 








o(v) = 





(V — Va), (7.23) 


where f = 0.416 is the oscillator strength and va is the frequency cor- 
responding to the Lyman-a photon (see Eq. (2.102)). The Dirac delta 
function in o(v) allows us to do the integration in Eq. (7.22) easily and we 
get 








mef ) nmh) dy(z) (7.24) 
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Tap(z) = ( 
where 
di(z) = cH *[Onr(1 + 2} + 9y]? (7.25) 


in a cosmology with OnR+Qy = 1. Here z is related to observing frequency 
Vo by (1+ z) = (Va/Vo) = (A/Aq). Clearly, all v > va will contribute to the 
absorption and will lead to an absorption trough in the continuum blueward 
of the Lyman-a emission line from the source. 

To estimate the numbers, let us consider the simple case of Qnr = 
1,Qy = 0 universe. Then we get 


me? f nalz) 3,-1 (QBh?\ nan 3 

Tef nml) z /2 

Tap (Z) meHove (+ 2872 6.6 x 10°A (a) = — (1+2) 
(7.26) 


where we have expressed the answer in terms of the mean density of hydro- 
gen nuclei at a redshift z, given by 





psi) 





H= Eee Y)Qg(1+z)? = (1.6x 1077 cm~?) (=e) (1+2)? (7.27) 
MH 0.02 
(where Y = 0.22 is the helium fraction). Equation Eq. (7.26) shows that 
even if neutral hydrogen is only a small fraction of the total hydrogen 
density, the optical depth should be fairly high. 
No such absorption troughs are seen in the continuum spectra of the 
quasars! The current upper bound at z œ 5 is about tgp < 0.1, implying 
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that (nyr/fiz) < 1076h. Even if 99 per cent of all the baryons are confined 
to bound structures and only 1 per cent remained as smoothly distributed 
component, this result requires the diffuse IGM to be ionized to a level that 
is better than one part in 104. Assuming that the bound on the optical 
depth is typically like r S 0.1 in the redshift range 0 < z < 5, we may write 
the above bound as 


Onr(nH) S 2 x 1077A71(1 + 2) 79/2. (7.28) 


As we shall see, this poses quite a bit of a problem which is still an open 
issue. We shall discuss this in some detail in the next section. 


7.4 Ionisation of IGM 


The Gunn-Peterson effect described above shows that IGM has to be pre- 
dominantly ionized at redshift z < 5. How to do this consistently in a 
structure formation scenario is one of the key open questions in cosmology. 
Roughly speaking, we need to form sufficient amount of structures which 
can emit the required number of UV photons that can ionize the IGM by 
z = 5 or so. This turns out to be not easy. 

One can think of collisional ionisation or photoionisation as two possi- 
ble means of achieving the ionisation. The collisional ionisation requires 
an IGM temperature of (10° — 10°) K or — equivalently — about 25 eV 
per atom. The evolution of stellar population, leading to supernova explo- 
sions (which reheat the ISM and contaminate it by metals), injects both 
mechanical energy as well as UV photons into the surroundings. But dur- 
ing the evolution of a typical stellar population, more energy is injected 
in the form of UV photons than in mechanical form. This is because the 
energy radiated per baryon (in nuclear burning) which eventually leads to 
solar metallicity Zo ~ 0.02, is about 0.007 Zomic? — of which about 
one-third is radiated as UV light. The massive stars, when they eventually 
explode a supernova, inject most of the metals as well as about 105° ergs 
of energy per explosion into the surroundings. For Salpeter IMF, there is 
about one supernova per 150 Mo of baryons in the form of stars. The mass 
fraction in mechanical energy works out to be about 4 x 10~® which is 10 
times lower than the energy released in ionizing photons. This suggests 
that between the two possibilities, the photoionisation could be a major 
effect in the physics of IGM. Hence we shall ignore collisional ionisation 
and concentrate on photoionisation. 
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In the case of photoionisation, the source of photons could be quasars or 
stars with M > 10Mo. We will see below that a high degree of ionisation 
requires an energy input of about 13.6 (1+ t/trec) eV per hydrogen atom 
where trec is the volume averaged hydrogen recombination time scale and 
t is the cosmic time. The factor (t/trec) is large compared to unity even at 
z œ 10 and thus photoionisation requires significantly higher energy than 
just 13.6 eV per hydrogen atom. 

If the mean intensity of ionizing radiation (in units of energy per unit 
area, time, solid angle, frequency interval) is J,, then the photoionisation 
rate per unit volume is given by 


Rawk i, CEO) ay (7.29) 


where the photoionisation cross section for hydrogen is given by 
wry 7/2 
or = 00 (=) s (7.30) 


with oo ~ 8 x 10718 cm? and wy ~ 2 x 101° Hz (see Eq. (2.106) and note 
that the dominant effect is from kgT ~ fw). This ionizing flux has to come 
from a variety of sources at high redshift. We shall see later examples of 
different ionizing sources and their spectrum. Right now, for the sake of 
illustration, we will consider a spectrum of the form 


J) = h2 x 107? erg cm~? s7} Hz™? ster}, (7.31) 


where vy = 3 x 10!° Hz is the ionisation threshold corresponding to À ~ 912 
A. It is assumed that Jo is of order unity. The magnitude of the intensity — 
with Jo ~ 1 — is typically what arises in most models involving star forming 
galaxies or quasars as the source which explains the choice of normalization. 
The v—dependence is more uncertain and is essentially modeled by the 
continuum spectrum of quasars (see chapter 9). The rate of ionisation per 
hydrogen atom due to the flux J, will be 





co 4 a 
R= J ik ordv ~ 3 x 1077 Jos™t. (7.32) 
vI 


In this case, the equilibrium between photoionisation and recombination 
will lead to the condition 


RNH = ANeNnp = an’, (7.33) 
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if Ne = np. In reality we also have to take into account the fact that the 
plasma may not fill space uniformly but could be clumped. If we assume 
that a fraction (1/C) of space is uniformly filled by plasma with the rest of 
the region empty, we need to multiply the right hand side of the Eq. (7.33) 
by a factor C. (Note that, in this case, the mean square density will be 
< n? >= (n?/C) =< n >? C). Then, we obtain 

nH aCny ~ 10-8 Ccmh?(1 + z)’ Gaz) (7.34) 


n R` Jo 104K 


Using the Gunn-Peterson bound on (np/ny) we can translate this equation 
to 
Qy /4 ayi /2 ( T 


1/4 
ea —9/4 
Qiam S so. AF372012 T z) (1 + z) ; (7.35) 


While the above bound by itself is not very restrictive, the detailed mech- 
anism for photoionisation as well as the issue of generating the necessary 
background intensity turns out to be quite non trivial. 

What we are really interested in is a statistical description of the ion- 
isation state of IGM. This can be done in terms of a volume filling factor 
Q for the ionized regions. In the early epochs, Q will be far less than 
unity and the radiation sources will be randomly distributed. The regions 
which are fully ionized (usually called Stromgen spheres) will be isolated 
from each other and every UV photon is absorbed somewhere in the IGM 
leading to a highly inhomogeneous UV radiation field. As time goes on, Q 
increases and percolation of ionized regions occur around Q = 1. Since the 
mean free path of Lyman continuum radiation is much smaller than the 
Hubble radius, the filling factor Q can be easily estimated by the following 
argument. The fact that every ultraviolet photon is either absorbed by a 
newly ionized hydrogen or by a recombining one, implies that Q(t) must be 
equal to the total number of ionizing photons emitted per hydrogen atom 
until that moment t, minus the total number of radiative recombinations 
per atom. That is, we must have 


Q(t) = [ at! Mien) _ fe? at! = = (7.36) 


Differentiating, we get 


dQ — Trion Q 


dt 7 NH trec 








(7.37) 
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which describes statistically the transition of the universe from neutral to 
ionized form. Given a model for ĉion, this equation can be integrated to 
describe the filling factor. Initially, when Q < 1, recombination can be 
neglected and the ionized volume increases at a rate (nion/n#). At late 


times, in the limit of a fast recombining IGM, the asymptotic value of Q 
will be 


Nion 


QS = trec- (7.38) 
NH 





This shows that the volume filling factor is less than the Lyman continuum 
photons emitted per hydrogen atom within one recombination time. In 
other words, of all the photons with v > vy, only a fraction (trec/t) « 1 is 
actually used to ionize the new IGM material. The condition for complete 
reionisation of the IGM will then be 


Niontrec Zz NH (7.39) 


which says that the rate of emission of UV photons must exceed rate of 
recombination. 

If the matter is clumpy, we need to multiply the right hand side by 
a clumping factor C. Assuming that average clumping factor is a good 
descriptor and that trec < t we can use Eq. (7.39) to arrive at a critical rate 
of emission for ionizing photons, Ñ, per unit comoving volume, independent 
of the previous history. This is given by 


-a (C\ (1+z\> (Qay? 

= 51-1 KAW f 

Faye ee s~* Mpc (S) ( F) (Sr) 
(7.40) 


To see the implications of this number, it is useful to determine an effective 
star formation rate which will provide this rate of photon emission. With 
usual IMF, for each 1Mg star, a fraction 0.08 goes into massive stars with 
M > 20Mo5. In these massive stars, at the end of carbon burning, nearly 
half the initial mass is converted into He and C, with a nuclear energy 
generation efficiency of 0.007. Further, about 25 percent of the luminosity 
is in the form of Lyman continuum photons with Æ > 20 eV. Thus, for 
every 1Mo star that is formed per year, we expect an emission rate of 











Moc? 


Nphotons % 0.08 x 0.5 x 0.007 x 0.25 x (som 


) ~ 105? photons st: 
(7.41) 


278 An Invitation to Astrophysics 


Using this, we can convert the critical UV photon flux to an equivalent mini- 
mum star formation rate per unit comoving volume, 6, = (Ne /N photons fesc) 
where fesc is a factor which determines what fraction of the photons escape 
out of the star forming region. Numerically, for Qgh? = 0.02, we have 


5 "A A OB ONL? 
px © (0.12Mo yr~' Mpc7*) (52) (5) E) . (7.42) 


This is a fairly high rate of star formation and the energy injected into 
IGM is comparable to (or even larger than) the energy radiated by quasars 
at the peak of their activity — which raises the question as to whether 
it is indeed possible to provide such a high rate of ionizing photons in 
the universe. It should be stressed that this estimate depends sensitively 
on the value assumed for fesce which denotes the fraction of high energy 
ionizing photons which can escape the star forming regions and reach the 
IGM. Direct observations of nearby galaxies at the present epoch suggest 
that this fraction can be very low — possibly only a few percent. If this 
is indeed true for the high redshift proto galaxies as well, then one would 
require significantly higher star formation rate in order to make this model 
work. i 

A more formal way of doing this calculation is as follows: The galaxy 
survey data for star forming galaxies at 2 < z < 4 provide direct informa- 
tion about the UV flux of these galaxies, since the rest frame UV continuum 
at 1500 Å will be redshifted into the visible band for a source at z = 3. 
There has been several detailed studies of ultra violet luminosity functions 
of Lyman-break galaxies spanning a factor of about 40 in luminosity. Inte- 
grating the luminosity function over L > 10°Lọo and using the fact that — 
for a Salpeter mass function — the luminosity at 1500 A is about 6 times 
the luminosity at 912 A, one can determine the comoving emissivity at the 
ionisation threshold, v = vz, for these galaxies. The resulting emissivity is 
about 


= i + 2) x 1075 erg s7! Hz~* Mpe~* (for z = 3) (7.43) 


(7 +2) x 10h erg s7! Hz~! Mpc? (for z = 4). 


There is, however, still the uncertainty as to what fraction of the photons 
actually escape the galaxy, which need to be addressed. If we assume that 
about 50 per cent of the photons escape (which is far higher than the 
fraction which escapes our galaxy today), then one gets about 1.58 x 105! 
photons s7! Mpc? at z = 4 which is just high enough to provide the 
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necessary ionisation. The situation at higher redshifts, of course, is quite 
uncertain due to lack of direct observational evidence. 


Exercise 


Exercise 7.1 Non conventional CMBR: 


Suppose all the energy in CMBR was produced by previous generation of stars and 
“somehow” thermalised. What would have been the corresponding metallicity of 
the universe? Does the energy density and metallicity make sense? 


Exercise 7.2 Gravity vs nuclear: 


An over dense region in the universe grows by gravitational instability and col- 
lapses to form a virialized halo. Baryons cool in this halo forming a galaxy which 
forms stars that emit radiation. In the entire process, what is the ratio between 
the nuclear energy which is converted to photons and the gravitational energy 
that is emitted as radiation? 


Exercise 7.3 Metallic shine? 


If the average metallicity of the universe is one-tenth the solar metallicity, estimate 


the mean bolometric surface brightness of the sky in magnitude arcsec”. 


Exercise 7.4 Tinkering with IMF: 


Study of population synthesis models suggest that for the Salpeter IMF the UV 
luminosity of a galaxy is related to the star formation rate by Eq. (7.13). A drop 
in the observed value of Luy is what leads to the corresponding drop in SFR 
at z > 1.2. This depends on Eq. (7.13) which in turn depends on Salpeter IMF 
being valid for the range of masses (10 — 125)Mo which is the main source of UV 
luminosity. Assume that the Salpeter IMF (x M~* with a = 2.35) gets modified 
at z > 1.2. Suppose the index a varies from a = 2.35 at z = 1.2 toa = 2.8 at 
z = 4, What will happen to the derived value of star formation rate in this case? 
Can this scenario be constrained in any other manner? 


Exercise 7.5 End result: 


Consider the diffuse photoionised intergalactic medium with an over density of 
5p/p % (5 to 10). Long after reionisation is over, this medium can be described 
by an approximate equation of state with P « p”. Determine n. 


Exercise 7.6 Poping once more: 


Astronomers have postulated Pop III stars which have very low core metallicity 
(say, Z = 107'°) and releases energy through CNO cycle. On the other hand, 
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standard Pop II stars has a metallicity of Z = 1074. If a 15 Mo Pop II star has 
L = 3 x 10*Lo, Tsu © 4 x 10* K, determine the corresponding values for a 15 
Mo Pop III star. What will be its lifetime and emissivity of ionizing photons? 
Can this help in re-ionizing the universe? 


Notes and References 


Section 7.2: The fits to star formation rates are based on Steidel et al. (1999) 
and are discussed in detail in chapter 1 of Padmanabhan (2002) 


Chapter 8 


Normal Galaxies 


The physics of stars, described in chapter 4 was relatively straight forward 
and one could obtain the basic features of stellar structure starting from the 
first principles. Ideally, one would have liked to do the same for the galaxies, 
which constitute the next higher level structures in the universe. This, 
however, turns out to be quite difficult. As we saw in the last two chapters, 
the formation of galaxies involves condensation of gaseous structures within 
the gravitational potential well provided by the dark matter halos. Several 
aspects of this process are still uncertain preventing us from obtaining the 
details of galactic physics from first principles. Hence we have to take a 
more phenomenological approach in this (and the next) chapter. We will 
describe some of the observed features of galaxies and then try to link up 
with the theoretical ideas without attempting to “derive” the properties 
from first principles. As you will see, one can make fair amount of progress 
by this approach. 


8.1 Morphological classification of galaxies 


The morphology of a galaxy — which is the simplest feature that can be 
ascertained from observations — will depend on the ratio between the time 
scale for star formation and the time scale for gravitational collapse. 

If most of the early, population II, stars form over a time scale somewhat 
shorter than the gravitational free fall time, then the protogalactic gas will 
be converted into a collisionless system of stars before the collapse is well 
underway. This will lead to spheroidal morphologies with stars moving in 
orbits with large eccentricity in the mean gravitational field of the galaxy. 
In this case, each galaxy forms in its own environment through an isolated 
collapse. One can also think of a variant of this scenario, in which a large 
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galaxy forms through mergers of a number of protogalactic fragments — but 
in a time scale comparable to the free fall time of the system — accompanied 
by rapid star formation activity. Since the time scales are comparable, these 
two scenarios are hardly distinguishable from each other observationally. 

It is also possible to imagine another extreme scenario in which the 
merger of large number of fragments occur over a time scale comparable 
to the Hubble time ¢(z) ~ H(z)~1 with most of the stars having already 
formed within the merging sub units. This is quite different from the first 
scenario and will, in general, lead to different final state. The gas, settling 
down inside a dark matter potential well, will gradually evolve to the lowest 
energy configuration for a given amount of angular momentum — which will 
be a centrifugally supported, rotating, thin disk. This occurs before most 
of the gas is converted into stars and hence this scenario — with lower 
star formation rate — will lead to disk like systems. At high redshifts, 
when merging was more frequent, the formation of massive galactic disks 
would have been unlikely since they are almost invariably destroyed during 
encounters. As the merging rate decreases, the gradual assembly of disk 
shaped galaxies increase. 

In reality, you will find that most of the galaxies fall somewhere in 
between these two extreme configurations. Most galaxies, when observed 
in visible band, indeed seem to be made of the two components we discussed 
above, loosely called bulge and disc with the understanding that there could 
exist galaxies in which one component may be completely absent. The bulge 
dominates the central portions of the galaxies, is reasonably spherical in 
shape and is supported against self-gravity by the orbital motion of the stars 
in the mean gravitational field. Most of the bulge stars are population II 
and have a large radial component of orbital velocity. The disc, in contrast, 
is a flattened structure in which stars move in almost circular orbits and it 
contains young massive population I stars, interstellar dust and molecular 
clouds. 

Different morphological types of galaxies correspond to combining the 
bulge and disk components in different proportions. If the galaxy has vir- 
tually no disc, it is usually called an elliptical and is classified as E0,E1..... 
where the notation En denotes a shape in which the ratio, (b/a), of major 
to minor axis is given by (b/a) = (1 — n/10). If the bulge-to-disk contri- 
bution is of order unity and the disk is reasonably smooth (with no spiral 
structure etc.) the galaxy is called lenticular and is denoted by SO. In the 
same configuration, if the disk shows additional structure in the form of 
spiral arms, it is called a spiral galaxy and is denoted by Sa. If, further, the 
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central region of spiral galaxy contains a prominent bar-like structure, such 
galaxies are denoted by Sb. Galaxies which do not fall in any of these cate- 
gories are called — rather unimaginatively — as irregulars. This particular 
classification scheme is called the Hubble classification and the EO, E1 etc. 
are said to represent different Hubble types. Historically, there was an idea 
that the sequence E0, El, ... ,S0, Sa/Sb could represent an evolutionary 
sequence. One often finds in literature, references to ‘early’ and ‘late’ type 
galaxies with the convention that EO is the ‘earliest’ and Sb is the ‘latest’. 
(We will occasionally use this terminology with the understanding that it 
has no direct physical basis.) The ellipticals and the bulges of disk sys- 
tems currently contain, in their stellar population, about 63 per cent of the 
baryonic mass that is in the galaxies in the form of stars, stellar remnants 
or gas. The disks contain only a total of about 21 per cent of the baryonic 
mass and the gas in the galaxies comprise of about 15 per cent of the total 
baryonic mass. 

The next relevant question is how stars (as determined by starlight) are 
distributed within a galaxy. Here we are faced with the problem that a 
galaxy does not have as sharp an edge as compared to, say, a star or a 
planet. The best thing you could do is to describe the stellar distribution 
by the surface brightness of the galaxy as seen in the plane of the sky. 
When a galaxy has a bulge and a disk component, we need to specify the 
surface brightness profiles for each of them. We shall begin with the disk 
component which also includes, as a limiting case, the disk galaxies. 


8.1.1 Properties of disk galaxies 


The surface brightness profile of disks has an exponential fall-off, given by, 
E(R) = Xo exp (—R/Ra) (8.1) 


where R is the 2-dimensional radial coordinate in the plane of the disc and 
Xo, Ra are constants. Outside the central region, disk galaxies exhibit this 
exponential luminosity profile though near the center some minor deviations 
are often observed. When seen edge-on, the luminosity of the disk arises 
from a very thin sheet well fitted by the function sech?(z/zo) where z is 
the vertical coordinate normal to the plane of the disk which is assumed 
to be located at z = 0. (For a theoretical motivation for this dependence, 
see exercise 3.3.) Combining these results, we may take the 3-dimensional 
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luminosity distribution of a disk as 
L(R, z) = Lo exp(—R/Ra) sech? (z/zo). (8.2) 


The center brightness of many galaxies are typically in the range of about 
(22 + 1) mag arcsec~? and zo is few hundred parsecs. 

Integrating Eq. (8.1) over all R gives the total luminosity to be Liot = 
Indo R2. The corresponding surface brightness projected on the sky, u, 
is usually measured in V-band mag arcsec™?. If the physical luminosity 
density is © (measured in Lye per square parsec where Lyọ is the V-band 
luminosity of the Sun), then these two quantities are related by 


D = 10/5064- Lyo pe™?. (8.3) 


For example, p = 26.4 V-mag arcsec™? will correspond to 1Lyọ pe~?. The 
easiest way to derive such relations is to note that surface brightness is 
independent of the distance (when cosmological expansion is ignored) and 
calculate both u and © at a convenient distance. We choose a distance 


D= 22 t60)? pc = 206265 pc (8.4) 


at which a length of 1 pc will subtend an angle of 1 arcsec. At this distance, 
the Sun will have the apparent magnitude 





my = Myo + 5log = 4.83 + 5 log(20626.5) = 26.4. (8.5) 


10 pe 
At this convenient distance 1 square arcsec will contain a luminosity L = 
LLyo-(1 pe?) and will have a magnitude my = 26.4—(5/2) logy. Equating 
this to u and solving for © we get the relation given above in Eq. (8.3). 
In terms of u, which is preferred by observers, Eq. (8.1) becomes 


u(r) = po + 1.086 (z=) l (8.6) 


A wide class of spirals has an exponential form for the surface brightness, 
with the central surface brightness ranging from 21.5 to 23 mag arcsec™?. 
Taking a value of uo ~ 21 V-mag arcsec™? and using Eq. (8.3) we get 
Xo = 170Lyọ pe~*. Thus the surface density profile of these spirals can 


be expressed in the form 


E(R) = 170e7F/ Fa Lyg pe-?. (8.7) 
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In our galaxy, the brightness towards the galactic poles above the Sun 
(which is located at a radial distance of R = Ro ~ 8 kpc) is about 24 
V-mag arcsec~*. This implies [through Eq. (8.3)] that U(Ro)/2 = 9Lyo 
pe~? with the factor half arising from the fact that there are two sides to 
the disk of Milky Way. If the exponential law is applicable to the Milky 
Way, then we can immediately determine the scale length using the value 
of (Ro); we get Ra = 3.6 kpc which is consistent with other observations 
as well. 

Let us next consider the central regions of disk galaxies which have been 
investigated extensively because many of the galactic bulges are suspected 
to harbor a massive black hole in the center. The radius of influence, ry, 
of a black hole of mass Mgy on a stellar system with velocity dispersion 
a? can be obtained by equating the gravitational potential energy per unit 
mass (GMpy/r) due to the central black hole to the kinetic energy per unit 
mass (02/2). This gives: 


GMpu Mesu Ox = 
rn = OW) =a (ee) (oom) e 8) 





Stars located at r < ry will be significantly affected by the black hole. 
Unfortunately, this region is rather small to be detectable directly in exter- 
nal galaxies. The angle subtended by this radius, in a galaxy located at a 
distance D from us, is about 


—2 -1 
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A vast majority of galaxies have D > 20 Mpc and ø > 200 km s™!; in 
such galaxies, we cannot detect the effect of a black hole unless it is very 
massive. 

While confirmed detection of the black hole is difficult, one can indirectly 
check for the existence of a black hole (of certain mass Mpy) by studying 
the density profile of stars near the origin. The effect of a black hole on the 
distribution of stars near the central region depends on the gravitational 
two body relaxation time, tr. Let us begin with the situation in which we 
are interested in time scales much shorter than tr; i.e., t < tr with tr 
given by (see Eq. (1.66)) 


3 5 -3 
o 10 {4x 10°Mo pe 
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where we have taken ln A ~ InN = 14,0 œ 150 kms7! and M = 1Mo for 
making the estimate. If the black hole forms by slow accumulation of mass 
over a time scale which is much larger than the orbital time scale (Ro/c) 
of the stars, then the evolution of the distribution function of the stars 
can be estimated in the adiabatic approximation. If the initial distribution 
function is further assumed to be a Maxwellian, then we have 


N 2 
HB) = Grease” (8.11) 
where no is the number density of stars and E = (v?/2) + © is the energy 
per unit mass. As an example of the application of this result, consider the 
density contributed by the stars that eventually become bound to the black 
hole. For these stars, E < g? and we can take f © no/(2m07)°/?.. Then 
the space density of the stars becomes 





2GMpu/r 2 4no TH 3/2 = GMpgu 
n(r) = 4r f PHE) i r= 
(8.12) 
Note that the black hole has introduced a weak cusp (n œ r73/2) in the 
density profile. 

The situation in the case of t >> tg is more complicated. Naive ap- 
plication of Maxwellian distribution in energy will give a density f « 
exp[—®(r)/o?] œ exp[ra/r] which diverges exponentially near the origin. 
This result, of course, is unrealistic because any star having a close en- 
counter with the black hole will be tidally shredded and swallowed by the 
black hole. If these processes lead to a steady state density distribution 
n(r) = no(r/r)§, then equilibrium would require energy balance between 
the potential energy of the stars which are swallowed by the hole and the 
flux of positive energy that must flow out through the cusp. Since v? œ r71, 
the relaxation time will vary as tp œ (v3/n) œ r(°73/2, The number of 
stars, N(r) within a radius r scales as N(r) œ r3~* while the energy of 
a star at a given radius r goes as E(r) œx r~!. Hence the rate of flow of 
energy through a radius r scales as (N(r)E(r)/tr) œ r(7/2-25). If this flow 
has to be independent of radius r, then we must have s = 7/4 which is a 
slightly milder cusp (n œ r~7/4) compared to the one obtained in Eq. (8.12). 
Numerical simulations show that this result is indeed true. 

Thus we can distinguish between two possible limiting behaviours in 
the core region of a galaxy. The first is that of a smooth stellar core with 
constant density and the luminosity profile of the form like, say, I(r) = 
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Io{1 + (r/rc)?]~! (where Ip is the central brightness and re is the core 
radius). The second is a central region with a cusp in the density so that 
the intensity goes as I œ r~* which is indicative of central black hole. 
HST observations seem to suggest that the distribution function of stars 
is indeed cuspy near the central regions of many galaxies, indicating the 
presence of central black holes. 

In many spiral galaxies, the inferred mass of the black hole based on the 
dynamics of stars is found to be well correlated with the absolute blue mag- 
nitude of the bulge. A linear relation exists between In Mpu and the blue 
magnitude Mg bulge which is equivalent to the relation Mgy œ 0.006mbuige 
where bulge is the mass of the bulge. 

The disk galaxies show fair amount of rotation and the study of rota- 
tion velocities as a function of radial distance has been a valuable tool in 
mapping the density distribution in and around the disc. In all the cases, 
the rotation curves (which is just the function v(r)) are either flat or gen- 
tly rising at large radii, as far as the observations go. This is one of the 
key evidences for the existence of dark matter halos around the galaxies. 
The shape of the rotation curve exhibits some interesting correlations with 
other properties of the galaxies. For example, the rotation curves tend to 
rise more rapidly near the center and peak at higher maximum velocity 
(Umax) With increasing B-band luminosity. It is possible to understand the 
origin of such a scaling relation (between the luminosity of the galaxy and 
Umax) under certain conditions. Starting from the result M œ Rv?,,,/G 
and assuming that the (M/L) ratio for disk galaxies is approximately con- 
stant, we get L œ Un RK: Further, if all disk galaxies have approximately 
the same central surface brightness, we would expect (L/R?) to be ap- 
proximately constant. Eliminating R from these two relations, we find 
that the total luminosity of the disk scales with the rotation velocity as 
L œ vi, = Cvs.,, where C is a constant. This is called Tully-Fisher re- 
lation. (The same result is obtained even if central brightness, I(0), is not 
a constant, provided (M/L) œ I(0)-1!/?.) At a more fundamental level, we 
can attempt to relate v and M using the results of Sec. 6.4 for a given fluc- 
tuation spectrum g9(M). For oo(M) « R-(+9)/2 x M-("+3)/6 | we have 
v x (M/R)! x MG-")/12_ At galactic scales the power spectrum has the 
approximate slope, n ~ —2 giving M œ vt. If (M/L) is constant, we again 
get L x vt. Taking the logarithms to obtain the absolute magnitude, we 
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L 
M = Mo — 2.5 log Zr. )= —10 log Vmax + constant. (8.13) 
© 


Observations show that the best fit relation between absolute blue magni- 
tude Mp and Umax is indeed of the form Mg = —alogumax + Ø where 
the pair (a, 8) has the values (9.95, 3.15), (10.2, 2.71), (11.0, 3.31) for Sa, Sb 
and Sc spirals. This is quite close to the relation L œ vé ax obtained above. 


8.1.2 Angular momentum of galaxies 


Another closely related feature of galactic systems, especially the disk like 
systems, is their angular momentum. The angular momentum of a galaxy 
can be conveniently expressed in terms of the dimensionless parameter 
à = w/wo where w is the actual angular velocity of the system and wo 
is the hypothetical angular velocity that is needed to support the system 
against gravity purely by rotation. We know that Mw? R œ GM?/R? while 
w? = J*/M?R* where J is the actual angular momentum of the galaxy. 
Eliminating the radius in terms of the energy by R = GM?/|E| we get 

w JEM? 


Wo GM5/2° 





A 


(8.14) 


It follows that a self-gravitating system with appreciable rotational support 
has a A comparable to unity. Observations suggest that disk galaxies have 
Aà 0.4 to 0.5. 

Of course, we need to understand how the galaxies got this angular 
momentum, parameterized by observed value of A. One possible way of ex- 
plaining the angular momentum of galaxies is as follows: During the initial 
collapse of the baryonic structures, tidal forces will be exerted on each pro- 
togalaxy by its neighbours. The tidal torque can spin up the protogalaxies 
thereby providing them with some angular momentum. During the col- 
lapse of the gas due to cooling, the binding energy increases while mass 
and angular momentum remains the same. This can allow à to increase as 
à x |E|!/? and (possibly) reach observed values. 

In this scenario, protogalaxies can only acquire initial angular momen- 
tum because of tidal torquing due to other collapsing material around it. 
If the typical mass, comoving radius and density contrast are M, R, 6, then 
the tidal acceleration of any one of a pair of blobs is (6GM/a*R%)(aR) = 
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(6GM/a?R?) where we have taken the separation between the blobs to be 
of same order as the size of the blobs and used the expansion factor a to 
convert the comoving radius to proper radius. If the masses involved are 
taken to be M/2 and the lever arm for the force is aR/2, then the net tidal 
torque is of the order of 

6GM M aR GM? 


~ = xX 


a? R2 2 5 2 4Ra ` (8.15) 


In a One = 1 universe, 6 œ a implying that the tidal torque is a constant 
and the angular momentum grows linearly with time: J = Tt. The total 
angular momentum acquired until the time of turn around tta can now be 
estimated for a spherical top hat model. Using the relation between energy 
and virial radius and the details of spherical top hat model, it is easy to 
show that the angular momentum parameter is about À œ 0.0257} if 
the object has an over density which is v times fractional density contrast. 
Once again simple arguments lead to meaningful final results though several 
factors of order unity have been ignored in the analysis. 

Further dissipational collapse of the baryons will increase the value of A, 
especially in the case of disks embedded in spherical halo. In the absence 
of dark matter halos (and assuming that a protogalaxy was just a self- 
gravitating cloud of baryonic gas) it turns out that the idea is not viable. 
But the idea works in the presence of a massive halo. In the presence of a 
massive dark halo, the initial spin parameter of the system, before collapse 
of the gas, can be written as \; = (J|E|!/2/GM*/) where the various 
quantities, J, E and M refer to the combined dark matter - gas system, 
although the contribution from the gas is negligible compared to that of the 
dark matter. After the collapse, the gas becomes self-gravitating and the 
spin parameter of the resulting disk galaxy will be Aa = (Ja|Ea|!/2/GM. a / =) 
where the parameters now refer to the disc. So we find that 


ue a (8.16) 

x (7) (= M) ` 
The energy of the virialized dark matter - gas system, assuming that the 
gas has not yet collapsed, can be written as |E| = kı(GM?/Re), while 
that of the disk is given by |Ea| = ko(GM?/r.). Here Re and ro are the 
characteristic radii associated with the combined system and the disk re- 


spectively, while k1, kg are constants of order unity which depend on the 
precise density profile and geometry of the two systems. The ratio of the 
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binding energy of the collapsed disk to that of the combined system is then 


Mal la (Me)* (22) ae 

E] k \M Re ; i 
Further, the total angular momentum (per unit mass) acquired by the gas, 
destined to form the disc, should be the same as that of the dark matter. 
This is because al] the material in the system experiences the same external 
torques before the gas separates out due to cooling. Assuming that the gas 
conserves its angular momentum during the collapse, we have (Ja/Ma) = 


(J/M). Hence, 
= (9) (8) GE)” 
TORORO ‘se 


where we have used Eq. (8.17) to simplify Eq. (8.16). The gas originally 
occupied the same region as the halo before collapsing and so had a pre 
collapse radius of Re. Hence the collapse factor for the gas is 


-BG e 


We see that the required collapse factor for the gas to attain rotational 
support has been reduced by a factor (Ma/M), from what would have been 
required in the absence of a dominant dark halo. For a typical galaxy with 
a halo which is ten times as massive as the disc, one needs a collapse by 
only a factor of about 40 or so before the gas can spin up sufficiently to 
attain rotational support. This is quite easy to achieve. (In the absence 
of dark matter halo, the factor (Mqa/M) ~ 10 is replaced by unity and one 
requires collapse factors which are ten times larger.) 

It is possible to extend these arguments and estimate the characteristic 
length scale, Ra, of an exponential disk profile exp(—R/ Ra) if we assume 
that there is no exchange of angular momentum between the disk and halo 
as the disk forms. For an exponential disk, you can easily compute the 
angular momentum per unit mass and obtain: J/M = 2Rqgvg where vg is 
the constant rotational velocity of the stars and gas. Combining this with 
the virial theorem |E| = K = (3/2)Mv? (where v? is the velocity dispersion 
of the halo) we can eliminate J and |E| from \ = (JE'/?2/GM®/2) in terms 
of M. Finally, expressing M in terms of the halo rotation velocity by 


Chapter 8 : Normal Galazies 291 


GM/R = v? and assuming that the density was a factor fe larger than 
the background density ppg(z) = 3Nnr(2)H?(2)/8TG at the time of the 
collapse redshift z, we get 


25 -1/2 -1/2 
Ra= =~ (Fh) of | Fomm@er@] ew 


If we now use fe = 180 and assume that when the disk settles down, vg ~ vp, 
we get 
Ho 


il as (8.21) 


Ry 26h! k (zaa) 
i ps 200 km s7! H(z) 


which relates the scale length of the disk Rg to the formation redshift z f 
of the disc. For reasonable scale lengths, disk could not have formed at 
redshifts significantly higher than unity. For Milky Way, with v ~ 220 
km s~!, Rg ~ 3.6 kpc we need Ho/H(zs) = 0.55 corresponding to ape 
0.5. The overall picture which arises from these considerations — that the 
spheroids and bulges form early on while the disks form at a somewhat later 
epoch — is consistent with the observations. It is, for example, known that 
metal poor stars in solar neighbourhood have highly eccentric orbits. This 
will arise naturally if such a spheroidal system of stars have formed early 
in a time short compared to the collapse time and their orbits ended up 
mixing through collisionless relaxation. 


8.1.3 Properties of elliptical galaxies 


Let us next consider the bulge component which has a more extended dis- 
tribution. Observations indicate that the bulge can be modeled by taking 
the fall-off as exp|—(R/Ro)*| with a œ 0.25. Since elliptical galaxies are 
dominated by a bulge component, this distribution also describes the sur- 
face density profiles of ellipticals. The scale length, Ro, is related to the size 
of the elliptical galaxy though defining the size of a galaxy itself is a non 
trivial task. It is conventional in the description of elliptical galaxies to use 
the effective radius, Re, defined such that half the light is enclosed by this 
radius. In terms of Re, the surface brightness of an elliptical galaxy (or, for 
that matter, the bulge component of a disk galaxy) is well-described by 


/ 
I(R) = Ie exp {ot (iz) ' — 1 (8.22) 
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which is called the de Vaucouleurs law. The corresponding result, in units 
of magnitude arcsec~?, is given by 


(8.23) 


R\14 
(R) = He + 8.3268 Iz) —1 





The total luminosity obtained by integration over all R is Lio, = 22.71, R2. 

In general, a galaxy will have both bulge and disk components. A 
typical B-band surface brightness profile of a Sa0 galaxy will be a sum of 
two contributions with different radial dependences adding up to give the 
observed surface brightness. At large radius the brightness profile is well 
fitted by the equation of the form: 


(8.24) 


0 
Hdisc(8) = co + €1 (=) 


where co and cı are constants. This is characteristic of an exponential disc. 
Near the origin, the intensity profile is well fitted by the equation 


6 1/4 
pstee®) = MCO) + c ( 1) (8.25) 


with u(0) and cg being two more constants. This is characteristic of a bulge 
near the center. If we assume this fit to extend all the way to origin, the 
central brightness due to the bulge will be significantly higher than the 
extrapolated central brightness due to the disc. Whenever a particular disk 
galaxy shows a dominant bulge, the intensity distribution of the bulge is 
quite close to de Vaucouleurs law in Eq. (8.22). 

In the simplest approximation, an elliptical galaxy is all bulge and no 
disc. The luminosity profile of the ellipticals are reasonably well fitted by 
the de Vaucouleurs profile in Eq. (8.22) away from the center but the central 
region shows a more interesting structure. Current HST observations show 
that several elliptical galaxies do show a cuspy region with ITar-%,a x 
(0.5 — 0.3) up to the limit of HST resolution. (We saw in Sec. 8.1.1 that the 
existence of a central black hole could lead to such a cusp in the density 
profile.) 

One significant systematic trend exhibited by the elliptical galaxies is 
the existence of what is called a fundamental plane. Observations show 
that the effective radius Re, the central velocity dispersion dọ and the 
mean surface brightness, J, (within the effective radius Re) are connected 
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by a relation of the form 
Re x oft? he. (8.26) 


That is, if the elliptical galaxies are plotted as points in the 
(In Re, Ingo, In Ie) space, they tend to lie on a two dimensional plane rather 
than scatter all over the three dimensional space. The best-fit equation to 
the fundamental plane, in terms of the magnitude pie, is 


log(R.) = 0.34ue + 1.49 log og — 6.60 (8.27) 


where Re is measured in pc, He in B— mag arcsec”? and op in km s7}. Since 
do and je can be measured directly from spectroscopy and photometry, one 
can use this relation to obtain the physical radius Re. Comparison with the 
apparent angular radius in the sky will allow us to determine the distance 
to the galaxy, thereby providing a very valuable observational tool. From 
virial theorem, one would expect 


M L(M IR? (M 


so that R x o?I>1(M/L)~!. If the mass-to-light ratio varies as M/L œ 
M°? x L-25, one gets R œ o4/3J~5/6 which could approximately account 
for the fundamental plane of ellipticals. 

The existence of the fundamental plane also allows you to derive a simple 
relationship between diameters and velocity dispersions of these galaxies. 
Let Dn be the diameter within which the mean surface brightness is Jp = 
20.75B— mag arcsec~? and assume that (i) the intensity profile is given by 
a function of the form I(R) = Ief (R/Re); (ii) most of the light interior to 
Dn, comes from radii at which f(x) ~ x7% with a ~ 1.2. It then follows 
that 


R 2 Dn/2re 
keai (ž) f doz f(x) (8.29) 
Dy, o 


which, for f(x) = 27%, gives D, « reld!® x re198. Using the Eq. (8.26) 
for the fundamental plane, we get 


Dyce te (8.30) 


which is called the Faber-Jackson relation. Observations show that this 
relation holds to a high accuracy. The weak dependence on J, shows that 
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there will be a tight correlation between Dn and ø allowing these relations 
to be used for estimate of distances. 


8.2 Models for stellar distribution in a galaxy 


For time scales large compared to the age of the universe at the epoch of 
formation of the galaxy, one can assume that the galaxy to be in a steady 
state. The stars in such a galaxy will be orbiting in the total gravitational 
potential generated by all the stars in the system. The distribution of 
stars in a galaxy can be described by a function f(x,v,t) which may be 
interpreted as the (relative) probability of finding a star in the phase space 
in the interval (x, x + d°x; v, v + d°v) at time t. The smoothed-out mass 
density of the stars at any point x will be 


p(x,t) =m J fxv, tdv, (8.31) 


where m is the average mass of the stars. Such a smoothed-out density will 
produce a smooth gravitational potential $(x, t) where 


V7o = 4nGp. (8.32) 


We may assume that each star moves in this smooth gravitational field 
along some specific orbit. The conservation of the total number of stars, 
expressed by the equation (df /dt) = 0, will reduce to 


df Of, OF, 


dt ôx Ov Ox 
OF 3f Of _ 
= 3 tya Veg =O: (8.33) 


which is usually called collisionless Boltzmann equation. Models for galax- 
ies are based on the solution to the coupled equations Eq. (8.32) and 
Eq. (8.33). 

The actual gravitational potential at any point x, due to the stars is 


Pact (X, t) aie 2 eo (8.34) 


where x;(t) is the position of the i-th star at time t. This will be different 
from the ¢ in Eq. (8.32) which was produced by the smooth density. Because 
of this difference, the actual trajectories of the stars will differ (appreciably) 
from the orbits of the smooth potential after sufficiently long time intervals. 
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This time scale is precisely the two-body relaxation time scale derived in 
Sec. 1.4; since this time scale is large compared to the age of the universe, 
we can ignore the difference between the exact gravitational potential and 
the one obtained by averaging the mass density. 

While the above equation provides an accurate description of a galaxy 
in steady state (with (Of /Ot) = 0 in Eq. (8.33)), it does not lead to a unique 
form of stellar distribution. This is because one can obtain a wide variety of 
solutions to these equations with different symmetry properties. Thus we 
can, at best, use this equation to describe the properties of a galaxy with a 
known distribution of stars — rather than obtain the result ab initio. We 
shall discuss a few such models in this section. 

When the galaxy is in a steady state, the distribution function does 
not explicitly depend on time and f(t,x,v) = f(x,v). To produce such 
a steady state solution we can proceed as follows: Let C; = Ci(x,v), i = 
1,2,--- be a set of isolating integrals of motion for the stars moving in the 
potential @ (which, right now, is not known). It is obvious that any function 
f(C;) of the Cis will satisfy the steady-state Boltzmann equation; (df /dt) 
= (Of /OC;) C; = 0 since C; is identically zero. If we can now determine ¢ 
from f self consistently, and populate the orbits of @ with stars, we have 
solved the problem. Let us consider some specific examples to see how this 
idea works. 

In these calculations, it is convenient to shift the origin of |¢| by defining 
a new potential Y = —¢+ ĝo where ¢o is a constant. (We will choose the 
value of ġo such that y% vanishes at the ‘boundary’ of the galaxy.) The new 
potential satisfies the equation V?y = —4rGp and the boundary condition 
aw — ġo as |x| — œ. We will also define a “shifted” energy for the stars 
e€ = —E + ġo; since dp = Y + ġ, € = -E + Y + ọ = —(1/2)0? + 4. 

The simplest galactic models are the ones in which f(x, v) depends on 
x and v only through the quantity e so that f = f(e) = f(p — $ v?). The 
density p(x) corresponding to this distribution is 


V2% 1 % 
ie [ 4nv*dvf (Y - 50") = I Andef(e) (Hoo). 835) 
0 0 
The limits of integration are chosen in such a way to pick only the stars 


bound in the galaxy’s potential. The right hand side is a known function 
of w, once f(e) is specified. The Poisson equation 


p 
a (e#) E E E e f defl) VGA (8.36) 
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can now be solved - with some central value ~(0) and the boundary condi- 
tion 7)'(0) = 0 — determining y(r). Once w(r) is known all other variables 
can be computed. 

Three different choices for f(e) have been extensively used in the liter- 
ature to describe the galaxies. We will summarize these models briefly. 


8.2.1 Lane-Emden models 


The simplest form of f(e) is a power law with f(e) = Ae"~3/? for e > 
0 and zero otherwise. Using Eq. (8.35), we see that this corresponds to 
density distributions of the form p = By” (for y > 0) with B = (27)3/? AT 
(n —1/2)[['(n +1)]71!. [Clearly, we need n > 1/2 to obtain finite density]. 
The Poisson equation now becomes 


1d 7 pe) J-€E €20 
paa) l é<o ER 
where we have introduced the variables, 
L = (4nGy(0)""1B)-2; l= (r/L); €= (%/4(0)). (8.38) 


This equation is the Lane-Emden equation which was discussed in Sec. 3.2. 
The case which is of interest in galactic modeling arises for n = 5. In this 
case, Eq. (8.37) has the simple solution 


€=(1+ sy} (8.39) 


which corresponds to a density profile of p œ [1 + (1/3)1?]-5/? and a to- 
tal mass of M = (V3L4(0)/G). This model (called the Plummer model) 
provides a reasonable description of a class of elliptical galaxies. 


8.2.2 Isothermal and King models 


The second choice for distribution function is the one of the form 


f(e) = TA P (5) (8.40) 


parameterized by two constants po and ø. One can easily verify that the 
mean square velocity < v > is 30? and that the density distribution is 
p(r) = po exp(y/o?). The central density is pe = po exp(W(0)/o7). It is 
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conventional to define a core radius and a set of dimensionless variables by 





902 \ r p 
= . l AI r — g . 
g ER , ro’ £ Pe y 


Then the Poisson equation can be rewritten in the form of Eq. (3.34) which 
is that of isothermal sphere. It was shown Sec. 3.2 that the solution has the 
asymptotic limit of € ~ (2/91?). Hence the exact solution has a mass profile 
with M(r) œ r at larger r; the model has to be cut off at some radius to 
provide a finite mass. 

For | < 2, the numerical solution is well approximated by the function 
ED = (14+ 12)-3, The projected, two dimensional, surface density corre- 
sponding to this €(1) is S(R) = 2(1+ R?)—!, where R is the projected radial 
distance. In other words, the (true) central surface density (0) ~ 2p-ro 
where pe is the central volume density and ro is the core radius. Since 
the mass density of the isothermal sphere varies as r~* at large distances, 
the potential ¢ varies as Inr. A disk of test stars embedded in such an 
isothermal sphere will have constant rotational velocities. A dark matter 
halo with such an isothermal distribution of particles will thus lead to flat 
rotation curves. Hence, isothermal sphere is a natural model for the dark 
matter halos. 

An important modification of the isothermal distribution function, 
which leads to a more realistic stellar distribution, is given by 


f(e) = EE (er = 1) ; €20 (8.42) 
and f(e) = 0 for e < 0. (This is called King model.) Since f(e) vanishes 
for e < 0, this model may be thought of as a truncated isothermal sphere. 
The density, when integrated from the origin numerically, will vanish at 
some radius rz (called the tidal radius). The quantity c = log|r;/ro] is a 
measure of how concentrated the system is. Bright elliptical galaxies are 
well described by such a model with [(0)/o?] = 10 and c ~ 2.4. 

The above examples show how galactic models of reasonable descrip- 
tion can be constructed as solutions to the collisionless Boltzmann equa- 
tion, Eq. (8.33). Unfortunately, this particular approach has no dynamical 
content and is merely a tool to fit observations. In fact, the situation is 
worse than that; for a wide class of acceptable p(r), it is always possible 
to obtain an f(e) which will self consistently reproduce the given density 
profile. To see this, we begin noting that a given p(r) uniquely determines 
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a y(r) and hence — on eliminating r between p(r) and y(r) — the function 
p(w). Writing Eq. (8.35) as 


1 Yp 
gO = HOVI Ed (8.43) 
V8r o 
and differentiating both sides with respect to 7, we get 
1 dp -, % f(ejde 


—— = ———. 8.44 
Vendy ~ Jy Vo=e ae 
This equation (called Abel’s integral equation) has the solution 
1 d ff 32) dy 
= —=—— > = | =, 8.45 
Naaa (3) Jes ii 


which determines f(€). Though this procedure gives an f(e), there is no 
assurance that it will be positive definite; obviously, the method works only 
for those p(r) for which f(e) > 0 for all e. 


8.2.3 Disk models 


Similar modeling also works for a disk-like system if the distribution func- 
tion is taken to be of the form f = f(e,Jz) where J, is the conserved 
z—component of the angular momentum. We will mention a few examples: 
If the distribution function is taken to be 


n 2 > 
fad) = AJ? exp(e/a*) (for Jz > 0) (8.46) 
0 (for J, < 0) 
then we obtain a surface density for the disk, which is given by: 
2 
(Rj soto (8.47) 





2nGR R 


The circular velocity of stars v? = —R(0~/OR) = 2rGLZoRo is a constant 
for this model, called Mestel’s disk. These parameters are related to the 
parameters in the distribution function by 


n= (v2/o?)—1; A= ZoRo [2"/? VaT(n + 1/2)0"*?| (8.48) 


The velocity dispersion in the radial direction g? is a free parameter char- 


acterizing the disk. The parameter n is a measure of the relative values of 
the steady circular velocity ve and the velocity dispersion ø and quantifies 
the “coldness” of the disk. 
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A more complicated set of disk models (called Kalnajs disk) can be 
obtained from the distribution function which has the form 

fle, Jz) = A [(02 — Rya? + (e +I) 7 (8.49) 


when the term in square bracket is positive and zero otherwise. This func- 
tion leads to the following surface density and potential: 





R2\1? r?GZo 1 
x = -= : = 2 = OR ; 
(R) = 3B (1 = ) ; an= Reso (8.30) 
with Yo = 2r Aay N2 — Q2. The parameter Q is free and describes the mean 
(systematic) rotational velocity of the system: < vg >= OR. 


8.3 Spectral energy distribution of galaxies 


Let us next consider the radiation emitted by a galaxy at different fre- 
quencies, usually called the spectral energy distribution (SED) of a galaxy. 
Since the galaxy is made of stars, gas and some amount of dust, it is — in 
principle — possible to express the total luminosity L(A) of the galaxy at 
wavelength À at any given time in the form 


L(A) = fev Cots exp(—T) + Lothers(A) (8.51) 


where p; is the density of stars of type i having a luminosity L;(à) and 
T is the optical depth for absorption of light at this wavelength within 
the galaxy; Lothers(A) is the luminosity due to non stellar sources. While 
stars contribute the dominant amount of energy in a galaxy, there are, 
of course, other sources which dominate the radiation in different wave 
bands. For example, the radiation in the x-ray band is possibly due to 
individual accretion disks in binary stars while the radio luminosity arises 
from synchrotron radiation from ambient electrons in the galaxy. 

Another major component of galaxies is dust — a rather disparaging 
term used by astronomers to describe silicates, carbides, metal oxides and 
several other components which usually blocks the light from the stars in 
the visual band. The dust lands up in the interstellar medium through 
the winds or mass loss from evolving low mass stars and also, of course, 
from supernova. As the gas moves further away from the stars, it cools and 
metallic compounds condense out of the gas. The equilibrium temperature 
of a dust grain at a distance r from a star of luminosity L can be easily 
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estimated by equating the flux of radiation emitted by the dust particle 
(treated as a blackbody at temperature Tyyst) to the radiation incident on 
the particle. This will give oT}. = (L/167r?)(Qin/Qout) where Qin, Qout 
denote the absorption and emission efficiencies of the dust grain and o = 
5.7 x 1078 W m~? K~‘ is the Stefan’s constant. If we take L = 100Lo, 
r = 700 AU, then we get Taust © 30 K (Qin/ Qout)!/ 4. The radiation from 
such a dust grain will peak around 100 microns. (In reality, Qout itself 
scales as Qout X aTdust where a is the radius of the dust grain and the 
actual temperatures of dust grains are somewhat lower.) 
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Fig. 8.1 Spectral energy distribution of a typical galaxy. Note that most of the light in 
the UV and optical band is contributed by the stars, while some amount of far-infrared 
radiation arises due to reprocessing of starlight by dust. The high energy radiation in 
x-ray band and radio emission are sub dominant. 


Figure 8.1 shows the schematic form of integrated spectral energy distri- 
bution of a model galaxy. The UV and optical light comes essentially from 
stars with different range of masses and corresponding temperatures. If the 
dominant contribution is from stars in the mass range (Mı < M < Mo) 
and if the temperature-mass relationship of stars is given by the function 
T(M), then the energy radiated per logarithmic band of frequency will be 


y M vtn(M) 
R i dM rph [ke T(M)] =1 a 
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where n(M) is the number density of stars in a mass range (M, M + dM). 
As the stars evolve, the parameters in this expression will change with 
time, making vI, change. However, two simple limits of this integral can 
be easily worked out. If the dominant contribution comes from a narrow 
range of masses, then the integral is dominated by a Planckian at a given 
temperature and the spectrum will approximately thermal. On the other 
hand, if a broad range of masses contribute so that the limits Mı and Mo 
are irrelevant, then the resulting intensity will be an approximate power 
law if T x M” and n(M) « M~°. Evaluating the integral, it is easy to see 
that 


vI, x v4 let) (8.53) 


in the Mı — 0, M22 — oo limit. These results, however, are applicable 
only over a limited range. At high frequencies, (hv > 13.6 eV, A < 912 
Å), absorption by hydrogen atoms will strongly suppress the intensity (see 
Fig. 8.1). At low frequencies, it is necessary to take into account re-radiation 
of energy by dust. A simple model for this re-processing can be based on 
the assumption that a fraction f of the optical-UV luminosity is absorbed 
and re emitted in the far infrared so that Lpr = fL. This FIR radiation 
will be in the form of a modified Planck spectrum with a spectral energy 
distribution Bye, where B, is the Planckian intensity for a temperature in 
the range of T ~ (45 — 80) K and e, is the dust emissivity which could be 
taken as a power law: €, x v? where x = 1 for v > 3 x 10!° Hz (100 um) 
and x = 2 for v < 2 x 10!! Hz (1 mm) with a linear extrapolation between 
the two limits. This will lead to the form shown in Fig. 8.1 in the FIR 
band. 

The total integrated light from a galaxy, in a given band, can be used 
to assign the magnitudes (and colour) in that band and one finds that 
the colour of a galaxy is broadly correlated with the galaxy type. By 
and large elliptical, lenticular and early spiral galaxies have absolute blue 
magnitudes —22 < Mg < —16, whereas later type spirals and irregulars 
have —18 < Mp < —12. 

This trend suggests, for example, that Sc galaxies will have greater 
fraction of massive main sequence stars relative to earlier spirals making Sc 
s bluer than Sa s and Sb s. For successively later type of galaxies, larger 
part of overall light is emitted in bluer wavelengths implying increasingly 
greater fraction of younger, more massive, main sequence stars. Irregulars 
are bluest of the Hubble sequence with (B — V) ~ 0.4. Many irregulars 
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also get bluer towards their centers (unlike early type spirals which become 
redder) suggesting that irregulars have active star forming regions in the 
center. These ideas about the composition are confirmed by the (M/L) 
ratio for spirals — which decreases progressively as one moves to later 
Hubble type — since upper main sequence stars have lower mass-to-light 
ratios. 


8.4 The evolution of galaxies 


Estimates of the age of the stars suggest that much of the star formation 
took place when the universe was about 20 per cent of its current age. If 
to ~ 14 Gyr, then most ellipticals and bulges of disk galaxies should have 
formed when the universe was about 3 Gyr old. To form a total stellar mass 
of about 10!! Mọ in a few Gyr, one requires star formation rates of about 
107M. yr~'. For any reasonable initial mass function, one would expect 
larger number of low mass stars to be formed compared to the high mass 
stars. But since high mass stars are significantly more luminous than the 
low mass stars, they dominate the luminosity in the initial stages of the star 
formation. After the first few million years of continuous star formation at 
a rate of, say, 10Mo yr7! a galaxy would have acquired a stellar mass of 
10’Mo. Of these, nearly 2 x 10° are massive stars contributing a total 
luminosity of about 2 x 10!°Lo. This activity increases the luminosity of 
the galaxy by about 20 percent and its colour becomes bluer because of 
the radiation by massive stars. Eventually (after about 10° yr), these high 
mass stars die out and the luminosity of the galaxy will be dominated by 
low mass stars. 

As the stars evolve over different time scales and move in the H-R di- 
agram, the physical properties of the galaxy will change with time. To 
study this process in detail, one begins with an initial mass function which 
characterizes the number of stars in a given mass range (which is produced 
at any given time) and some assumption regarding the variation of star 
formation rate in time. The initial mass function is usually taken to be a 
power law, (dN/dM) x M~®% with an index a ~ 2.35 (see Sec. 4.5.1). As 
regards the star formation rate, the two extreme limits which will bracket 
the possibilities are: (i) all the stars have formed in a single burst at some- 
time in the past or (ii) that the star formation rate is constant over the 
relevant period of time. Standard stellar evolution theory can then be used 
to evolve these stars forward in time and to determine the light emitted by 
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the collection of stars at any given instant. This procedure is usually called 
population synthesis. 

To illustrate the ideas which are involved, let us consider the case with 
index a = 2.35, metallicity Z = 0.008 and a steady star formation rate 
of 1Mo yr—!. Numerical modeling then shows that the luminosity of such 
galaxy can be parameterized both at t = 2 Myr, and at t = 900 Myr by 
the fitting function: 


rete 1 


At t = 2 Myr, the fitting parameters are given by A) = 3000A and k = 4 
which corresponds to a Planckian distribution with temperature T = 4.73 x 
104 K. At t = 900 Myr, the fit is with Ao = 1000A and k = 2.2 which is 
fairly distorted from a pure Planckian. 

This index k = 2.2 can be understood as follows: First, note that, for 
most of the range of À ~ (103 — 10*)A, the function (exp(A9/A) — 1)7? 
can be approximated as proportional to à making AL, x A~*+1. Next, 
recall (from Eq. (8.53)) that when a broad set of stars contribute to the 
luminosity, vI, « AL, œ AT4+(@-D/" giving 


pas ~(a-1). (8.55) 


Finally we saw in chapter 4 that for a wide class of stars, we have Tsu, « 
M”; Lax M”? giving Teur x M™e"#® x M” with n = nyRny % 0.3-0.6. 
If we use n = 0.5,a@ = 2.35, we find from Eq. (8.55) that k = 2.3, pretty 
close to the best fit value. Also note that this relation L, « A72? gives, in 
terms of frequencies, 


Ly x A?’ Ly œ v®? = constant. (8.56) 


These galaxies emit almost constant power per unit frequency in the range 
of 103 A < \ < 104 A; putting in all the numbers gives the result that the 
flux is about 102°° W Hz~? for a SFR of 1Moyr—!Mopc-?. 

In the case of a burst of star formation, we will expect the galaxy to 
redden as its stars age if the star formation does not continuously replenish 
the evolution and disappearance of old stars. Simple models for stellar 
evolution do lead to a conclusion in conformity with this. Figure 8.2 shows 
the numerical result for colour evolution for a Salpeter initial mass function 
and a burst of star formation. The colour-colour diagram for main sequence 
stars is superposed on the figure for comparison. 
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Fig. 8.2 The evolution of a typical galaxy depicted in the color-color diagram. The 
results for the galaxy, shown by the solid line, is obtained by numerical population 
synthesis assuming a Salpeter IMF and a single burst of star formation. The dashed 
curve corresponds to main sequence stars and is shown for comparison. 


This result regarding the luminosity evolution of galaxies can also be 
qualitatively understood from stellar evolution theory. Since the stars with 
M 2 1.25Mo, evolve in t < 5 Gyr, in any stellar population which origi- 
nated from a burst of star formation, there will be only stars with masses 
M $1.25Mo at t > 5 Gyr. Further, these low mass stars emit most of their 
energy during the giant branch phase. Since the time these stars spend in 
the giant branch (GB) is fairly short, we can estimate the luminosity of this 
population by 








IM T (8.57) 


La (rr) [Ae 

Mc 
where Egp(M) is the total energy emitted by the stars in the giant branch, 
(dN/dM) is the number of stars in the population with mass in the range 
(M,M+dM) and Mgp(t) is the age at which a star of mass M turns off 
the main sequence into the giant branch. Based on our understanding of 
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stellar evolution, we can take 


Mcx(t) ~( Ey aN M\* 
Mo Noa) ~ “am mm) (8.58) 





with a ~ 2.35. Combining these, we get 


_ KMoEcs(Mas) (Men \** * (8.59) 
~~ (25 Gyr) Mo 
or equivalently 
dla L a dln EcB dln Mcp Ès dln Ecg 
ding = iMen + (8.5 = a)| -at = 0.4a — Mae aa è 
(8.60) 


Since (dln Egp/dln Meas) is in the range (0, 1) and a & 2.35, we ex- 
pect L to be a decreasing function of t; more detailed models suggest 
(dln L/dlnt) = 0.3a — 1.6 which is in reasonable agreement with the above 
analysis. 


8.5 Luminosity function of galaxies 


The abundance of galaxies, (or, for that matter, any class of objects) can 
be characterized in two different ways. The first one provides a count of 
objects with a given apparent flux (or magnitude) per solid angle in the sky. 
Since the information is 2-dimensional and one is not measuring the actual 
distance to the object, this observation is comparatively easy to make and 
is essentially limited only by: (i) the limiting flux which can be obtained 
in a given wave band and by (ii) the angular resolution of the instrument. 
If n(F,) = (dN/dQdF,) denotes the number of sources with the observed 
flux densities between F, and F, + dF, per solid angle in the sky, we can 
immediately obtain the total intensity of radiation due to all the sources as 


[o 6] 
uel F, n(F,)dF,. (8.61) 
0 


If apparent magnitudes are used instead of fluxes, they can be related by 
F, x 10704 with the constant of proportionality depending on the wave- 
length band in which observations are carried out. In this case, the inte- 
grand in Eq. (8.61) is proportional to 10-°4"(dN/dQdm). Multiplying 
the intensity by (47/c) will give the energy density of photons at the local 
neighbourhood due to these sources. This has been done, for example, with 
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the source count of galaxies available from Hubble deep fields (HDFs). The 
HDF survey has detected about 3000 galaxies at U300, B450, Veos, Igi4 and 
1700 galaxies at J110, Higo bands. At still longer wavelengths, there are 
about 300 galaxies in K-band, 50 at (6.7 — 15)um and 9 at 3.2m. When 
all the galaxies up to the limiting magnitudes are taken into consideration, 
the total integrated contribution of light effectively converges and the inte- 
gration shows that vI, varies in the range (2.9 — 9.0) nW m~? sr—! in the 
wavelength band (3600 — 16, 000)A. 

A more detailed description of the abundance of the sources is provided 
by the luminosity function ®(L, z)dL, which gives the number of sources per 
unit comoving volume at redshift z in the luminosity interval (L, L + dL). 
Depending on the nature of the source, L could be either the luminosity in 
a given wave band or bolometric luminosity. This quantity can be obtained 
only from a survey which also has redshift information for the sources. 
Further, in order to be useful, there must exist sufficient number of objects 
at any given redshift bin. 

The luminosity function for galaxies has been investigated extensively 
for nearby galaxies (z ~ 0). We shall concentrate on these results and 
will briefly mention the status of investigations at higher redshifts. While 
considering galaxies at z = 0, the luminosity function is a function of a 
single variable, ®(L)dL, which gives the number of galaxies per unit volume 
in the luminosity band L, £L+dL. Observations suggest that this luminosity 
function is reasonably well described by 


eee (oon 


which translates to 


(M) = (0.4 In 10) &* 1094+D"—™) exp (—10°4°-34)) (8.63) 


in terms of magnitudes. The parameter ®* = (1.6 + 0.3) x 10~2h? Mpc73 
sets the overall normalization; a = —(1.07 + 0.07) determines the slope of 
the luminosity function at the faint end; L% = (1.24 0.1) x 101°? Lo 
(which corresponds to the blue magnitude M$ = (—19.7 + 0.1) + 5logh) 
is the characteristic luminosity above which the number of galaxies drops 
exponentially in the blue band. 

This function, called Schecter luminosity function, is plotted in Fig. 8.3 
by the unbroken line. Since a ~ —1, the integral of ®(Z) over L can 
diverge (or become very large) at low L suggesting a copious number of 
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low luminosity galaxies. The total luminosity density contributed by all 
galaxies, however, is finite: 


kot = f LO(L)dL = PLT (2 + a). (8.64) 
0 


For a = —1, hot = OoL*. There is fair amount of scatter in the observed 
values of 6*, M* and a in published literature. Different observations have 
given ©* varying in the range [1.4 — 2.6] x 10-2h? Mpc73, M*(B) varying 
in the range (—18.8 to —19.68) and a varying in the range (0.97 — 1.22). 
For example, Sloan Digital Sky Survey (SDSS) gives in R-band, the values 
a = —1.16 + 0.03 and ¢. = (1.50 + 0.13) x 1072h? Mpc™?. The ltot, 
however, does not vary all that much and most observations give around 
ltot = 2 x 10êhLo Mpc73. 

There has been some attempts to study the luminosity function of galax- 
ies at high redshifts in order to ascertain whether there is systematic evo- 
lution. At z œ% 3, the luminosity function of galaxies can be fitted with 
a = —1.60+0.13, m, = 24.48+0.15 and ®, = 1.6x10~?; at z = 4, the corre- 
sponding parameters are about a ~ —1.60, m, œ% 24.97 and ®, = 1.3x 107? 
in the I-band. It is difficult to make useful comments about galaxy evolution 
based on these results at present. 

Given the luminosity function for galaxies, one can estimate several 
useful quantities in a relatively straightforward manner. To begin with, 
ignoring the effects of spacetime curvature, one can relate the redshift z and 
observed luminosity f of a galaxy to its distance r and intrinsic luminosity 
L by z œ v/c ~ (Hor/c) and f = (L/4rr?). Then the joint distribution 
function in z and f for galaxies is 


dN 1 [®_ _ , L Hor L 
dQdzdf — = ii (=) ôD (- c ) ôD (s =a) 
(8.65) 


The factor (1/47) in front converts the number density to density per solid 
angle and the Dirac delta functions select out the required redshift and flux. 
Using the known form of ® and evaluating the integral, we get 





dN An ( c (8.66) 


5 
pee e fie 4p 27,2 
dndzdf L, $) EP Ze) aA 


The redshift distribution at a fixed apparent luminosity f varies as 
z4®(z7/z2) which shows the effect of diminishing volume at low z (the 
z* factor). The quantity ze is essentially the mean redshift of galaxies with 
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Fig. 8.3 The Schecter luminosity function for galaxies. The solid line gives (L) and its 
numerical value is indicated on the left hand side vertical axis. The dashed line gives the 
number of galaxies per logarithmic band in luminosity and the corresponding numbers 
are given in the vertical axis on the right hand side. 


a flux f. The mean luminosity per unit volume is given by 


1 pe L\ dL 3 1; 
j= i Lọ EJL T(2+ a)®,L, = (0.77 — 1.3) x 10°hL@Mpc”. 
0 * * 
(8.67) 
From this, one can estimate the typical number density of galaxies 
n, = z =T(2 +a), = (6.7 — 14.9) x 10-3A?Mpe~3 (8.68) 
and the mean separation between the galaxies 
da = n7? = (4.13 — 5.35)h~ Mpc. (8.69) 


Multiplying the observed mass-to-light ratio of galaxies R = (9.8 — 
14.65)h(Mo/Lo) by the luminosity density j in Eq. (8.67) we get the mass 
density of galaxies to be 


p = JR = (0.88— 1.62) x 10°h? Mo Mpc? = (5.82—10.8) x 10722h? g cm™?. 
(8.70) 
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The luminosity function also allows us to estimate the gravitational 
binding energy locked up in the galaxies. If the galaxies are assumed to be 
embedded in the dark matter halos which are isothermal spheres of mass 
M and line-of-sight velocity dispersion o, then their binding energies will 
be U = 2Mo?. Of course M here is the total mass and is related to the 
galaxy luminosity L by some (M/L) factor. The quantity usually measured 
in solar units (Mo/Lo) is about 10 for galaxies. As we saw earlier, (see 
Eq. (8.13)), the velocity dispersion can be related to the luminosity using 
the relation: L/L* © (a/o*)*. Assuming a constant (M/L) ratio allows 
us to convert the mass to the luminosity and express U in terms of L. 
Integrating over the luminosity function then gives the binding energy per 
unit volume in galaxies to be 


U = 26*L*a*? < M/L >T(5/2+a) (8.71) 


* 


o* is about 240 km s~! for ellipticals and 170 km s~! for spirals. Taking 
an average value, the binding energy of the galaxy is given by 


U ~ 1300h? (Se) Moc? Mpc”?. (8.72) 


(Note that Moc? Mpc~? = 6.1 x 10” erg cm~3.) 

Finally, you should appreciate a key observational aspect in any astro- 
nomical survey. The faint end of any luminosity function is intrinsically 
hard to determine. For example, consider a population of sources for which 
the number of sources in the luminosity bin (L, L + dL) is a power law: 
N(L)dL x L~°dL or, equivalently, 


dN 
din L 
The maximum volume V (L) in which a source of luminosity L could lie and 
still be detected in a flux limited survey varies as V œ L?/2. It follows that 
the number of sources observed per logarithmic interval in the luminosity 
will vary as: 





x Lie, (8.73) 


n(L) « Va oc L5/2-@, (8.74) 


Suppose you want survey to have dynamic range y (so that (Lmin/Dmax) = 
107%) with about No = 10* objects in the faintest bin, Then, it is nec- 
essary to have N > 10710¥(5/2)-) objects in the survey. The ran- 
dom Poisson fluctuations in the faintest bin will give a fractional error 
of No 1/2 _ 1072/2, To get 20 per cent accuracy in the lowest (log L) bin, 
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say, we need 0.2 ~ No 1/2 ~ 1072/2 giving z ~ 1.5. Then the constraint 
on N translates to about 2500 sources for a = 1.5, y = 2 while one needs 
about 2.5 x 10° sources for a = 1.5,y = 4. If œ = 1 then one requires 104 
sources for y = 2 and 10® sources for y = 4. These numbers show that 
accurate determination of luminosity function of any set of faint object, in 
order to make a census of the universe, will require fairly elaborate surveys. 


8.6 Distribution of galaxies 


Clustering of galaxies into groups is typical in the distribution of the galax- 
ies in the universe. A survey within a size of about 20 Mpc from our galaxy 
shows that only (10 - 20) percent of the galaxies do not belong to any group; 
they are called ‘field’ galaxies. 

To study the clustering of galaxies quantitatively one needs a good sur- 
vey of the universe giving the coordinates of galaxies in the sky. Of the 
3-coordinates needed to specify the position of the galaxy, the two angular 
coordinates are easy to obtain. There exist today several galaxy catalogs, 
containing the angular positions of galaxies in particular regions of the sky, 
complete up to a chosen depth. If we know the redshift z of these galaxies 
as well, then we can attribute to it a line-of-sight velocity v & zc. If we 
further assume that this velocity is due to cosmic expansion, then we can 
assign to the galaxy a radial distance of r & Hj /v. This will provide us 
with the galaxy position (r,6,¢) in the sky. 

Currently available data points to an interesting pattern in the distri- 
bution of galaxies in space. The single most useful function characterizing 
the galaxy distribution is what is called the two-point-correlation function: 
€aq(r). This function is defined via the relation 


dP = 7 (1 + éca (rı T r2))d°r\d°r2, (8.75) 


where dP is the probability to find two galaxies simultaneously in the re- 
gions (r1,T1 + d°r;) and (r2,r2 + d’r2) and 7 is the mean number den- 
sity of galaxies in space. The homogeneity of the background universe 
guarantees that Eaa (r1, r2) = cc (rı — r2) and isotropy will further make 
Eaa(r) = €4aa(|r]). From Eq. (8.75) it follows that &acq(r) measures the 
excess probability (over random) of finding a pair of galaxies separated by 
a distance r; so if éca (r) > 0, we may interpret it as a clustering of galaxies 
over and above the random Poisson distribution. Considerable amount of 
effort was spent in the past decades in determining €gg(r) from observa- 
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tions. These studies show that 


SN 
éaa(r) > (=) ; ro = (5 + 0.5)h7} Mpc; y = (1.77 + 0.04) 


To 
(8.76) 
in the range 10h~'kpe < r < 10h7!Mpc. 

Incidentally, it is also possible, using the catalogs of rich clusters (like 
the Abel catalog which contains 4076 clusters), to compute the correlation 
function between galaxy clusters. The result turns out to be correlation 
function 


-1.8 
Eco(r) © (same) (8.77) 


Comparing with the galaxy-galaxy correlation function gq quoted before, 
we see that clusters are more strongly correlated than the individual galax- 
ies. If firmly established, this result has important implications for the 
theories of structure formation. Unfortunately, &cc(r) is not as well estab- 
lished as gg and hence one has to be cautious in interpreting results which 
depend on écc. 

The distribution of matter is reasonably uniform when observed at scales 
bigger than about 100h7! Mpc or so. For comparison, note that the size 
of the observed universe is about 3000h~!Mpc. Thus, one may treat the 
matter distribution in the universe to be homogeneous while dealing with 
the phenomena at scales larger than 100 Mpc or so. 


Exercises 


Exercise 8.1 Dynamical friction near galactic center: 


Assume that the mass density of stars near the center of the galaxy varies as 
p(r) x r~7/4 with the total mass contributed by the stars within a radius of 
about 1.8 pc being about 3 x 10°Mg. Near the center, there is a black hole of 
mass Mgy which dominates the dynamics and the stars move in circular Keplerian 
orbits under the influence of this black hole in the central region. (a) Show that 
the time scale for dynamical friction of stars scales as r'/*. (b) Calculate this 
time scale for a 10Mọ star at r = Ipc. [Ans. 4 x 10° yr.] 


Exercise 8.2 Black hole in M 87: 


The giant elliptical galaxy M 87 in the Virgo cluster has a black hole of mass 
3 x 10°Mo at its center surrounded by a diffuse hot intracluster medium with a 
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density of 0.5 cm7? and a sound speed of 500 km s71. “Show that an order of 
magnitude estimate for the accretion rate of this medium on to the black hole is 
about 0.1 Mo yr™!. If the efficiency of conversion of energy to radiation is y = 
0.1, what is the resulting luminosity as a fraction of the Eddington luminosity? 
[Actual observations indicate a luminosity of 10*? erg s7! suggesting that it is a 
low efficiency accretion flow.] 


Exercise 8.3 Rotation curve: 


The rotational velocity of the Sun in the Milky Way galaxy is ve(R) = 220 km 
s7! at R = 8 kpc. What does this imply for the mass contained in the region 
r < R? Radio observations of the 21 cm emission from neutral hydrogen in the 
direction 30 degrees from the galactic center (on the same side as the direction 
towards which the Sun is orbiting) show a maximum radial velocity of 90 km s~?. 
This could be interpreted as coming from a cloud of neutral hydrogen moving in 
a circular orbit of radius r in the galaxy and located such that the radial velocity 
is maximum. The observed radial velocity, of course, is the difference between 
the true circular velocity ve(r) and the component of velocity of Sun in the same 
direction. Use this to estimate the true uc(r) and the mass contained within this 
radius. (This is the technique used to obtain the actual rotation curve of Milky 
Way galaxy.) 


Exercise 8.4 Practice with surface brightness: 


Consider a typical Sbc galaxy, NGC 5055, located at a distance of D = 8 Mpc 
and seen at an inclination i = 60°. (This is the angle between the normal to the 
plane of the galaxy and the line-of-sight.) R-band photometry along the major 
axis of this galaxy shows that the surface magnitude can be fitted reasonably well 
by the relation 
0 
HR = 19.75 + 0.72 (x) ‘ (8.78) 
1 arcmin 

Determine the surface luminosity profile in physical units. [Answer: Dr(R) = 
190e~ ("7/35 FP) L Ro pe7?. | 


Exercise 8.5 Integrated magnitude: 


A spiral galaxy has a surface brightness profile I(R) = Ine~®/** where Ip = 21 
mag arcsec"? and Rs = 7 kpc. Calculate the total magnitude of the galaxy if it 
is at a distance d. [Ans. Since magnitudes are logarithmic, one cannot simply 
add them but one can convert them into fluxes and then add. This should give 
I= Tot = —11.79 + 5log(d/1 kpc).] 


Exercise 8.6 Practice with sky background: 


(a) Show that a sky brightness of 21 mag arcsec~? is the same as 1.6 x 10° 


photons s~' cm~? A~? ster~! = 3.78 x 107° photons s~! cem”? A~! arcsec7?. 
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(b) Compute how many photons per second will be collected from 1 arcsec” of the 
sky by a 1 m telescope with a V-filter of width 1300 A. (c) Take the B-band Tully- 
Fisher relation for spiral galaxies to be Mg = —9.95 log(umax/1 km s~*) + 3.15. 
Further, let R25 be the radius at which the surface brightness of the spiral galaxy 
falls to 25 mag arcsec~? in B-band. It is also known that Ros in kpc obeys the 
empirical relation log Res = —0.249MpB — 4. Given that a particular galaxy has 
Umax = 320 km s~! and has an apparent B = 12.22 mag determine the absolute 
B magnitude, the distance to the galaxy and the radius R25. If one wants to 
measure the galaxy profile down to R25, how much time do you need in a 4m 
class telescope? 


Exercise 8.7 Counting the stars: 


The surface brightness of a galaxy at a distance d is Io mag arcsec™*. Assume 
that the galaxy is made up of stars of absolute magnitude M. Show that the 
number of stars contained within 1 arc second square patch of the image of this 
galaxy is N = (d/10 pc)?10°4%—/0), 


Exercise 8.8 From stars to the galaxies: 


One can do a better job on the spectral energy distribution of a galaxy by using 
our knowledge of stellar luminosities. The results discussed in chapter 4 allow 
you to relate the surface temperature to the mass of the star. Assuming that each 
star emits as a blackbody at this temperature and convolving it with an initial 
mass function, one can obtain the total radiation at any frequency v due to the 
stellar population. One can also compute U-B and B-V indices for the system. 
This is, of course, valid for a young galaxy in which none of the stars have evolved 
off the main sequence. Do this and compare your results with those quoted in 
Sec. 8.4. 


Exercise 8.9 Poor man’s population synthesis: 


The purpose of this exercise is to use a simple model for the evolution of galaxy 
luminosity to capture the essential physics. To do this, let us assume that there 
are only 3 types of stars O, A and M with masses (40, 4, 0.5) Mo, V-band 
luminosities (2.5 x 10°, 80, 0.06)Lov, B-V colours (—0.35, 0.0, 1.45) and the main 
sequence lifetime (1.6 x 10°,5 x 10°, 7.9 x 10°°) yrs. The relative number of each 
type of star is given by the Salpeter IMF at t = 0. (a) When all the stars are still 
in the main sequence, what is the M/L ratio and the B-V colour of the galaxy? 
(b) What will be the corresponding values at 10” yrs and at 10° yrs? Note that in 
evaluating M/L at later times you should take into account the mass contributed 
by the remnants. 


Exercise 8.10 How many galazies can you see? 


Suppose all the light from a galaxy is from stars with L = 0.2L and that there 
are about 101 stars in the galaxy. Show that the absolute magnitude of such a 
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galaxy is about —21. What is the maximum distance at which such a galaxy will 
be visible to naked eye? Does it make sense? Is the fact that the starlight from 
a galaxy is spread over a patch in the sky relevant? 


Exercise 8.11 Vertical oscillation of stars: 


Assume that our galaxy is a circular disk with radius R and vertical height H 
with the stars distributed uniformly in the vertical direction (which we take as 
z—axis) and as an exponential disk along R. Consider a star which “drops” from 
a high zo and oscillates in the vertical direction normal to the plane of the galaxy. 
Estimate the frequency of oscillations in the vertical direction if the local density 
of stars near the Sun is about 0.2 Mo pe~?. 


Exercise 8.12 MACHOS: 


If our galaxy has a flat rotation curve with vo = 220 km s~", estimate the local 
density of dark matter at R = 8.5 kpc. (Assume that dark matter is distributed 
as a spherical isothermal sphere.) Suppose the dark matter is made of MACHOS 
(MAssive Compact Halo Objects) each of which has a mass of 0.5 Mo. (This is 
not true!) Estimate the number density of such objects near solar neighbourhood. 
How does it compare with the number density of stars? 


1 


Exercise 8.13 The galactic potential: 


It is often convenient to have an analytic model describing the gravitational 
potential of Milky Way. One such model consists of 3 components: disk, spheroid 
and a halo. The disk and spheroid are assumed to have the potential of the form 


= -GM 

[R2 + (a + Jz? + b2)2]3/2 
with different parameters from the disk and spheroid. For the disk, one takes 
aa = 4.20 kpc, ba = 0.198 kpc, and Ma = 8.78 x 10’°Mo while for the spheroid, 


it is as = 0,bs = 0.277 kpc, and Ms = 1.12 x 10!°Me. The halo is described by 
a spherically symmetric potential 


te= CO 5 In (2 + 5) + = tan`! (=)| (8.80) 


Th Th 


(R, z) (8.79) 





where r is the spherical polar coordinate radius and ra = 6 kpc, Mn = 5x101" Mọ. 
(a) Compute the speed of the circular orbit in the galactic plane. What is the 
value near the Sun? Do you get flat rotation curves at large radius? (b) Compute 
the density distribution wkich will give rise to these potentials. What is the mass 
density in the solar neighbourhood? (c) Consider a star which was moving in a 
circular orbit at some radius ro. Suppose it was perturbed slightly either in the 
radial or in the vertical directions. Estimate the radial and vertical oscillation 
periods. 
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Exercise 8.14 Sinking to the center: 


(a) Consider a galaxy of mass 3 x 10’!Mo moving in a cluster containing a 
large isothermal dark matter halo of density p ~ 10~7° gm cm~3. Compute the 
time scale for dynamical friction using Eq. (1.66). Also show that the drag force 
acting on the galaxy makes the angular momentum and the kinetic energy of the 
galaxy to decay. Work out the time dependence of the orbit if the halo has an 
isothermal density profile. (b) A satellite galaxy of mass M is orbiting our galaxy 
which has an isothermal halo of velocity dispersion 220 km s7! and a total mass 
of 3 x 10'?Mo. Integrate the equations numerically and determine the orbit of 
the satellite galaxy with and without the dynamical friction. Do you think the 
orbital decay will have observable effects? 


Exercise 8.15 Real life astronomy: Galary surveys: 


For this problem, one can choose from one of the many galaxy surveys. Let us 
concentrate FLASH survey [Kaldare et al., MNRAS, 339, p. 652 (2003)]. (a) 
From the survey pictures which goes up to 2x 10* km s7}, determine the physical 
distance dmax probed by this survey. The survey is sensitive to galaxies with blue 
apparent B magnitude brighter than 16.7. What will be the absolute B magnitude 
of the galaxy at the limit of the survey and the corresponding luminosity? (b) The 
Milky Way has a blue luminosity of 2 x 10’°Lgq. What is the largest redshift at 
which FLASH would have picked up a galaxy with the luminosity of Milky Way? 
Comment on the implication. (c) The structures that you see in galaxy survey 
pictures — which are elongated in the radial direction (dramatically called fingers 
of God) — arise due to velocity dispersion within a cluster. Estimate the velocity 
dispersion for one of the structures in the picture taking the data from it. (d) 
If you look at the corresponding figures of a deeper survey (like Las Campanas; 
look that up) you see less of ‘fingers of God’. Why? 


Exercise 8.16 Real life astronomy: MACHO lensing: 


Take a look at Alcock et al., Nature, 365, p. 621 (1993) which has the light curves 
of the first LMC microlensing event detected by macho experiments. (a) Use these 
light curves to determine the characteristic time scale t of amplification which may 
be defined as the time during which A(t) > 1.34. (b) Calculate the total mass 
of Milky Way galaxy up to Rimc = 50 kpc assuming a flat rotation curve at 
Ue = 220 km s7}. (c) Using the above results estimate the radius of the Einstein 
ring re. [Ans: About 1 AU.] (d) Assuming that the lens is half way between us 
and the LMC, estimate the lens mass. [Ans: 0.05 Mo.] Analysis of subsequent 
events show that a typical mass of a lens is closer to 0.5 Mo. In principle, this 
could be a 0.5 Mo star or a 0.5 Mo white dwarf. Which do you think is more 
likely? (e) Several years of MACHO experiment allows one to determine the 
probability for micro lensing. One finds that it is about p = 1.2 x 1077 per star 
that is monitored. This allows an estimate of the mass density contributed by the 
MACHO and thus the total mass of MACHOs up to LMC. Compare this with 
the result obtained in (b) above. 
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Exercise 8.17 Real life astronomy: black hole at Milky Way center: 


Take a look at Schodel etal., Nature, 419, p. 694 (2002) which has a figure giving 
the orbit of a star (S2) a point near the center of our galaxy (around Sgr A). (a) 
Assuming that the orbit is an ellipse and that the plane of the orbit is inclined 
to the plane of the sky, describe how you would calculate the inclination angle 
from the given data. (b) Assuming that the physical size of the semi major axis 
is a = 4.62 x 107? pc and the angular size is 0.119 arcsec, determine the distance 
to the galaxy center used in this paper. This distance is known to an accuracy 
of about 0.5 kpc. What is the error induced on the semi major axis due to this 
and how does it compare with the measurement uncertainties in the paper? (c) 
Determine the mass M of Sgr-A and the uncertainty in its estimate. 


Notes and References 


The following textbooks deal with the subject of galaxies in much greater detail: 
Binney and Tremaine (1987), Elmegreen (1998), Carroll and Ostlie (1996), 
Binney and Merrifield (1998) and Padmanabhan (2002). Figure 8.2 was adapted 
from chapterl of Padmanabhan (2002). 


Chapter 9 


Active Galaxies 


So far we were describing galaxies which are ‘normal’ in the sense that their 
energy emission is mostly due to starlight. Another class of extragalactic 
sources which are of considerable interest but are not powered by starlight 
is called Active Galactic Nuclei (AGN, for short). These objects display a 
variety of exotic features many of which are still not fully understood. 

The AGN have spectacular properties. They produce very high lumi- 
nosities (even 104 times the luminosity of a typical galaxy) from within a 
tiny volume (less than about 1 pc*). Their spectra extend over a broad 
range of frequencies and the luminosity per decade in frequency is roughly 
constant across several orders of magnitude in the frequency. Their line 
spectrum (in optical and UV) is quite strong and the total flux in lines 
could be few to tens of percent of the continuum flux. Some of the lines 
have a width which indicates bulk velocities ranging up to 104 km s7}. 
AGN also show remarkable variety in their radio morphology and strong 
time variability in different wave bands. They will be the subject of this 
chapter. 


9.1 AGN: Basic paradigm and the spectra 


One of the simplest characterization of AGN is that they are very compact 
objects emitting anywhere between (104? — 104%) erg s7! of power. Since a 
typical galaxy has a luminosity of 104 erg s~!, the AGN have power output 
which ranges from 1 percent of a typical galaxy to 104 times as large. A 
power of 1047 erg s7! 
at the rate of about 2 solar mass per year. Alternatively, this is equivalent 
to the energy output of about 10” O type stars confined to a volume of 
about 107° pc?. You can see that AGN are unusually bright objects! 


is equivalent to the conversion of the rest mass energy 
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For nearby AGN, which can be actually seen as embedded in a diffuse 
host galaxy, the optical image of the nuclear region appears as a bright point 
source whose flux can exceed the flux from the rest of the host galaxy. 
This suggests that in most of the AGN, which are too far away for the 
host galaxy to be detected, we can attribute the emission as arising from 
a compact nuclear region of the host galaxy, thereby justifying the name 
active galactic nuclei. 

The optical luminosity of most AGN vary at a level of about 10 per cent 
in a time scale of few years, which is quite unlike normal galaxies. The 
relative change in the amplitude seems to increase at higher frequencies 
with nearly a factor 2 variation in x-rays. Unlike variable stars, AGN do 
not show any periodicities or special time scales. 

The continuum radiation from the AGN is quite different from the spec- 
tra of normal galaxies. Roughly speaking, the luminosity of a typical galaxy 
arises from the luminosities of the individual stars which radiate as black- 
bodies. The typical stellar surface temperature varies only by about a factor 
10 among the stars that dominate the galactic power output. Since the bulk 
of the radiation in a blackbody is confined to a range which varies only by 
about factor 3 in frequency, a typical galaxy emits all its power within no 
more than one decade in frequency; in fact, it is usually less than a decade. 
The continuum spectra of AGN, in contrast, is relatively flat in terms of 
vF,, all the way from mid-infrared to hard x-rays, spanning a range of 10° 
in frequencies. That is, the broadband spectrum of an AGN continuum 
can be characterized by a power law: F, = Cv~* where a <1. Such a 
power-law spectrum has relatively far more energy at high frequencies com- 
pared to blackbody spectrum. Compared to normal galaxies, the fraction 
of energy radiated in radio is about an order of magnitude larger and the 
fraction that is emitted in x-rays could be even 3 to 4 orders of magnitude 
larger. There are minor features in the spectra, but no one frequency band 
dominates the energy output of AGN. 

To make a quantitative estimate, let us calculate the total amount of 
energy density due to AGN using the observed number of AGN (per stera- 
dian in the sky) having a flux between F, and F, + dF. The photons we 
receive from distant cosmic sources sets a (useful) lower bound to the total 
amount of electromagnetic energy which was radiated in our direction. If 
the universe is truly homogeneous and isotropic, then the number of pho- 
tons here, today, is a good indicator of their mean density in the universe. 
It follows that the local energy density of AGN photons at frequency v is 
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just 


An dN 
U, = = far, (5) F,. (9.1) 


This quantity can be calculated in any wavelength band (optical, X-ray, 
....jin which survey results for AGN searches in the sky are available so 
that we know (dN/dF,). 

Observations show that the number N(< mg) of quasars (which are a 
prominent subclass of AGN; in fact we will use the quasars as a prototype 
of AGN throughout the chapter) per square degree in the sky brighter 
than mg in the B band can be expressed in the form log N = amg + 8 
with a = 0.88, 8 = —16.1 for mg < 18.75 and a = 0.31 and 8 = —5 for 
mp > 18.75. Computing dN/dmg, converting F, to mg and evaluating 
the integral in Eq. (9.1) in the range, say mg = (12,24) we get the total 
B-band flux (over 47 steradian) to be about J, = 40 Jy. Multiplying 
by the B-band frequency and dividing by c gives the energy density per 
logarithmic band vU, ~ 0.8 x 10717 ergs cm~$, To find the energy density 
in all the bands we need to repeat this exercise in each band. However, 
since good quality data is not available in other bands, we can attempt to 
estimate this by applying a bolometric correction from, say, the B-band 
luminosity to the bolometric (i.e. total) luminosity. Because of redshift, 
the observed frequency v will correspond to the frequency v(1 + z) at the 
time of emission and the bolometric correction will, therefore, depend on 
the redshift. Taking these into account by a factor of about 30, one finds 
that the local energy density due to AGN radiation is about 


B 
Erad œ~ 2 x 10718 (Žž) erg cm™? (9.2) 


where B is the bolometric and redshift correction. For comparison, note 
that energy density of starlight is about 104 times larger, CMBR. energy 
density is about 10° times larger and the integrated light from normal 
galaxies (with Mg = 20) is about 10 times larger. 

If the AGN radiation was emitted through a process of efficiency 7, then 
the cosmological mass density used up by all the quasars per unit volume 
is given by 

Pace = ee =2 x 10397! Mg Gpe™?. (9.3) 
This is the mass density which, when converted through a process having 
efficiency 7, will lead to the observed radiation from the AGN. If the process 
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goes on in a wide class of field galaxies with a luminosity L ~ L* (see 
Sec. 8.5), then we can obtain the amount of mass which each of the galaxies 
should provide for this process, by dividing this expression by the space 
density of galaxies with luminosities within, say, one magnitude of L*. 
The latter is given by 2 x 10°h3 7, Gpc~* (see Sec. 8.5) so that the mass 
contributed per galaxy is about 


B -1 
Mactive ~ 1.6 x 10’Mo (5) (+) ee (9.4) 


The scaling of this expression with respect to 7 is noteworthy. The efficiency 
of chemical reactions corresponds to about 1 eV per atom so that the energy 
efficiency per unit rest mass is about Nchem = 107 !° if the mass of the atom 
is about 10m,. In the stellar nuclear reactions, the energy released in 
hydrogen burning is about (8 MeV/m,) corresponding to nuci œ 8 x 1073. 
If such processes are to be used for powering AGN radiation, then the active 
mass which is involved will be 10!°Mo in chemical reactions and about 
10°Mo for nuclear reactions. In contrast, accretion on to a gravitational 
potential well from the lowest stable circular orbit around a black hole can 
lead to efficiencies of the order of Nacer * 0.1 which is the scaling used in 
the above expression. When a body of mass m falls from r = co tor © 5Rs 
where Rs ~ (2GMpy/c”) is the Schwarzschild radius of a compact object 
of mass Mpy, it radiates the potential energy 


=! GMpum 7 GMpum 


U fe ie ca 
5Rs 10GMpu/c? 


= 0.1 me’. (9.5) 


This gives an efficiency 7 of 10 percent which is an order of magnitude more 
than the efficiency achieved in hydrogen fusion. (Only direct particle-anti- 
particle annihilation can give rise to higher efficiencies.) In other words, 
among all processes which assemble certain amount of fuel for producing 
radiation from a compact region, accretion will require the least amount of 
fuel for a given amount of radiation. Since assembling the fuel in a compact 
region and converting it into radiation is a difficult process, the problems 
will be aggravated in any model which uses a process with lower value of 7. 
This leads to the standard paradigm of AGN being powered by an accretion 
disk around a massive black hole. 

There is another observation which is also suggestive of this paradigm. 
If the radiation, emitted from a region of size R which is moving with speed 
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v, varies at a time scale At, then we must have 


2 
R< A eani- 5. (9.6) 


This constraint arises from the fact that in the rest frame of the emitting 
region, causal processes can act coherently only over a size R < cAr since 
the information about the variation in light intensity can only travel a 
distance cAr in a proper time interval Ar. Using Ar = At/y leads to 
the inequality given above. For At = 1 hour (which is the time scale of 
variation observed in AGN) and y ~ 1, we get R < 10!4 cm which is even 
smaller than the size of the solar system! Since the total luminosity of 
the AGN is larger than that of an entire galaxy, we need a very efficient 
and compact mechanism for power generation. Of all the known radiation 
mechanisms, accretion on to a massive object fits the bill since it can easily 
convert rest mass energy into radiation with an efficiency of a few percent 
from a very compact region. 

Let us now examine this process in greater detail. With an efficiency of 
n ~ 0.1, one can achieve a luminosity L = nMc? if the accretion rate is M. 
That is, 


tpi ee -1 ({_L V 


which is not difficult to achieve in the dense galactic centers. We, however, 
know that accretion luminosity is bounded by the Eddington luminosity, 
Lpg (see chapter 2) which is related to the mass Mpy of the compact object 
by 


Lpwi5x 1038 Mou ergs}. (9.8) 

Mo 
To achieve a luminosity L ~ 5 x 106 erg s~!, we need Mpy > 3.3 x 10°Mo. 
Note that the mass corresponding to a black hole with the Schwarzschild 
radius R ~ 10! cm (which is the size of the central engine determined 
from time variability) is indeed 3.3 x 10°Mo. This suggests that one might 
be able to get a consistent picture involving a massive black hole fueled by 
accretion. 

The real difficulty in accretion models is not with energetics but with: (i) 
time scales and (ii) in providing a mechanism which will make matter shed 
its angular momentum. To see the issue of time scales, note that the age of 
the universe at the redshift of z ~ 5 is only about (5—10) x 108h9.75 yr. This 
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is not very long compared to the galactic free fall time (about 10°(r/10 kpc) 
yr) or the black hole growth time defined by 73H = (M/Mgu)~}. Using 
M x L/ņ and Mpy « Lp, we find that 7gq © 4 x 107m. (Le/L) yr. Ifa 
black hole has to grow from a original mass of about 10M to a final mass 
of 108Mo, it would take about 16 e-folding times; that is, the time required 
is more like 167gH œ% 6 x 1087.1(Le/L) yr. The universe just may not 
have sufficient time at high redshifts to allow this. Low efficiency accretion 
permits more rapid growth of black hole (probably in its early stages) but 
this will require far more total mass to be accreted in order to provide the 
same amount of energy. 

The situation is made worse by the fact that the angular momentum 
of the matter needs to be removed efficiently if accretion has to work. For 
matter in Keplerian orbits in a galaxy, the angular momentum for unit 
mass is J/m = (GMr)!/? where M is the mass inside a region of radius 
r. If we take M ~ 10!1Mo and r = 10 kpc for estimating the initial 
angular momentum J; and take M = Mpy = 10’Mo and r = 0.01 pe for 
estimating the final angular momentum Jp, then Jy + 10-°J; showing that 
almost all the angular momentum needs to be removed by some viscous 
drag if accretion has to be feasible. 

Assuming this can somehow be achieved, an accretion disk around a 
supermassive (~ 108M) black hole will provide a nearly thermal radiation 
in the optical and UV. At first sight this might sound surprising since stel- 
lar mass compact objects in HMXB/LMXBs emit in X-rays (see Sec. 5.5) 
and you might expect supermassive black holes to lead to higher energy ra- 
diation. In fact, the increase in the Schwarzschild radius of the black hole 
works in the opposite direction: If we take the luminosity emitted from 
regions near the black hole to be proportional to L « (GM: BHM )/Rs where 
R, = (2GMpu/c?), then L « M and is independent of Mpy. If this energy 
is emitted from an area A œ R?, the flux will be F « (L/A) x (M/M3y). 
Assuming the radiation is thermal with temperature T, we can write 
F œ T* thereby leading to the estimate of the disk temperature 


ae 1/4 

c 

Taisk = | — r . 
disk (z) (9.9) 


where all the proportionality constants have been re-introduced. If the total 
emission is a fraction f of the Eddington luminosity and the efficiency of 
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the process is 7, we also have the relation 


4nGe 
k 


Mc? =f 





Mpu (9.10) 


(where « = 0.2(1 + X) cm? gm”! is the electron scattering opacity with 
X = 0.7), leading to 


M = + Mpun. (9.11) 


Substituting into Eq. (9.9), we get 


3c f ue -1/4 
Taisk = | ————— ; ; 
iak (= = i) a Mgt (9.12) 


This shows that the temperature of the disk actually decreases with in- 
creasing mass of the black hole. As an example, consider a situation with 
Mpu œ 10°Mo, f ~ 1,7 = 0.1 and L ~ 1.5 x 1048 erg s~!. This requires 
an accretion rate given by Eq. (9.11) which translates to M ~ 2.6Mo yr7?. 
The corresponding temperature obtained from Eq. (9.12) is Taise = 7.3 x 10° 
K. Such a spectrum will peak around 400 A in the extreme UV band. This 
thermal] radiation could be responsible for a ‘bump’ seen in the UV spectra 
of many AGN. 

Further, if the radiation is in equipartition with a magnetic field strength 
Br, then we obtain a characteristic magnetic field of 


M -1/2 
2) 4 
Br ~4x10 (az) Gauss. (9.13) 
The lifetime of electrons emitting synchrotron radiation in such a magnetic 
field will be about 


cool Me —1 GM -1 M 
x | — —— ] 0. — : 9.14 
mie (m) (a) en (Game) * 00 


These numbers imply that we can expect significant radio luminosity due 
to synchrotron emission; in fact — if most of the luminosity L is due to 
synchrotron radiation — then the characteristic frequency will be v ~ 2 x 
10!4(M/108Mo)—5/14 Hz. But the synchrotron lifetime of the electrons will 
be fairly short and re-acceleration mechanisms are required for sustained 
emission. 

Compared to radio and optical, the origin of high energy spectra is more 
complex and is ill understood. Pair production processes as well as inverse 
Compton scattering could contribute to high energy radiation though the 
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details differ from source to source. It is easy to see from the following 
arguments that pair production has to be be important. The number n, 
of photons near the threshold energy for pair production, hy + mec?, can 
be estimated by dividing the energy density, U ~ L./4nR?c, of gamma-ray 
photons within a region of radius R by the threshold energy mec?. The 
optical depth for pair production is about Ty-y ~ nyorR and the pair 
production will be important if 7,_, Z 1. This condition can be stated in 
terms of the compactness parameter £ = (L+/R)(or/mec*) as 


= Borm > 4r. (9.15) 


Expressing the compactness parameter in terms of Eddington luminosity 
Lg, and Schwarzschild radius Rs, as £ = 2a(Ly/Le)(Rs/R)(Mp/mMe), the 
condition for pair production becomes 


Ly — 2Mme {f R _3f R 
— >= — | — ) xl. — }. 9.1 
igo mh (=) 1x10 (=) (9.16) 


Since R = 10Rg, it is possible to meet this condition in many AGN sug- 
gesting that pair production can be an important effect. 

Finally, note that the remnant mass in Eq. (9.4) was estimated assuming 
all bright (L ~ L) galaxies host an AGN. Alternatively, if only about 
1 per cent of the bright galaxies host the AGN, say, then the remnant 
mass has to be hundred times larger: Mrem © 2 x 10°h-3BMo. Since 
the total energy radiated by AGN can be expressed in terms of M, Meu 
and an efficiency factor, one can also estimate the remnant mass as Mrem © 
8x 1072-371 (tg/tayn)Mo where tg is the time interval for which the AGN 
is ‘on’ and tayn is the evolution time scale which will be of the order of the 
age of the universe. If tg % tayn and 7 = 0.1, then the remnant masses 
are large (Mrem © 10°h~370.1Mo). In this case, only a small fraction of 
the galaxies harbour AGN and (L/Le) < 0.05h70.1. On the other hand, if 
tg X tayn, then every bright galaxy could have a AGN with, say, Mrem © 
10°7h~8no.1Mo if tg © O.ltayn. In this case, (L/L) $ 3hn01. These 
arguments illustrate the interdependence of these estimates and highlight 
the implicit assumptions. 


9.2 Radio jets and bulk relativistic motion 


A sub class of AGN show bright radio lobes and jets connecting the central 
source to these lobes. Because of the high resolution and sensitivity which 
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can be achieved in radio band, it is often possible to do the imaging of 
the AGN with milli arc second resolution in this band and study the radio 
features in detail. 

In contrast to the optical/UV emission, the radio spectrum is non- 
thermal and is most likely to be synchrotron radiation from relativistic 
electrons. The synchrotron emissivity due to a power law distribution of 
electrons with N(E) œx E~® is (see Eq. (2.58)) 


€y x BEtD/2y1-2)/2 ay, (9.17) 


Hence a power law spectrum of electron will lead to a power law synchrotron 
emission with the two indices related by a = (1/2)(x — 1). This leads to a 
non-thermal, power law spectrum in the radio band for the AGN. 

For luminous radio jets moving with relativistic speeds, one also needs to 
consider several effects of special relativity to relate the observed quantities 
to the intrinsic properties of the source. Some of them can be counter 
intuitive and deserves a discussion. 

The first nontrivial effect due to relativistic motion is that the apparent 
proper motion of a radio jet in the sky could be at a speed greater than 
that of light. This arises as follows: Consider a blob of plasma which moves 
with speed v at an angle w to the line of sight. Two signals emitted by the 
blob in a time interval dte in the rest frame of the source will be detected 
during a time interval dtp in the rest frame of the observer. There are 
two effects which make dte # dto. First is the cosmological redshift which 
scales the time intervals by the factor (1 + z); it turns out, however, that 
the really interesting features do not arise from this factor and hence we 
shall temporarily ignore it. The second effect is due to the fact that the 
blob moves a distance vdte cosy in the direction of the observer between 
the emission of two signals so that the second signal has to travel less 
distance compared to the first. This reduction in the distance implies that 
the signals will arrive in a time interval (in units of c = 1) 


dto = (1 — v cos %) dte. (9.18) 


The distance traveled by the blob in the plane of the sky, transverse to the 
line of sight, is dl, = vdt.siny so that the apparent transverse velocity 
Vapp measured by an observer is 


dte _ vsin y 
dto 1—vcosy 





dl . 
Vapp = = = (vsiny) (9.19) 
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It is obvious that the apparent velocity (indicated with a subscript ‘app’) 
can be greater than unity even for large inclination angles provided y is 
high enough. 

The following results can be obtained directly from Eq. (9.19): (i) For 
a given value of v, the apparent velocity vapp is a maximum when y~ = 
cos-!y, with the maximum value being yv. (ii) In the limit of v > 1, 
maximum occurs for ~ œ (1/7). (iii) It also follows from this analysis that 
superluminal motion occurs at some angle provided vapp > 1 corresponding 
to v > (1/,/2). (iv) The same analysis can be repeated in terms of a fixed 
observed value for Vapp. The minimum speed v required to produce a given 
value of Vapp occurs at p = cot~!(vapp) with Umin = v/V1 + v2. (v) When 
cosmological redshift is taken into account, Eq. (9.18) needs to modified by 
a factor (1+ z) on the right hand side which does not change the condition 
for observing the superluminal motion. 

The relativistic motion also affects the flux of the source. If cosmological 
factors are ignored, then the constancy of I,,/v* implies (see Sec. 2.1.2) that 
the intensity (or surface brightness) of the source will scale as 


I, (Dvo) = D?I, (vo) (9.20) 
where 
D = TES 9.21 
~ (1 — vcosp) ot) 


is the standard special relativistic factor (see Sec. 2.3). For an unresolved, 
optically thin, source or for an optically thick spherically symmetric source, 
the flux F(v) transforms in the same way as the intensity. In particular, 
for a source with power law spectrum F(v) x v~%, we have the result 


F(v) = Dte F’ (v) (9.22) 


where the primed quantities are those in the rest frame. It is clear that 
there is boosting of the observed flux over the rest frame flux. When the 
cosmological redshift cannot be ignored, the observed frequency v and the 
rest frame v’ are related by 





(9.23) 





Fw) =( z ) "Fe. (9.24) 
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As an application, consider a source which emits two blobs of material 
moving in two diametrically opposite directions Y and m + with the same 
speed v. If Fin and Fout are the fluxes from the sources moving towards 
the observer and away from the observer, then we have 


(9.25) 


Fn _ /1+vcosy are 
Fat  \1l—vcose 


which shows that there is a preferential enhancement of fluxes in the com- 
ponent which is moving towards the observer. 

The results described above are valid for a single blob of material moving 
with a given velocity. In the case of a continuous jet formed out of a series 
of blobs, the number of blobs observed at any given instant itself scales as 
D-' and hence the index in the formulas for the flux change from (3 +a) to 
(2+ a). So the ratio of the flux arising from advancing jet to the identical 
receding jet will become 





Fin _ (eer) 


Fout l — vcosy (9.26) 


Such a relativistic beaming effect is often invoked to explain the fact that 
several observed radio sources show jet like structure only on one side. The 
idea is that the jet is intrinsically two sided but you only see one of them 
which has its flux enhanced due to this effect. If the ratio of the fluxes is 
R = Fn/Fout, we can obtain an upper limit on the angle of inclination to 
be 


R!/2+a) _ 4 


< Ri/@+e) 4 1° (9.27) 


The relativistic effects also play a role in determining the time scale of 
variability of the sources and hence in putting bounds on the size of the 
source. If the observed variability is in a time scale Atya,, then the intrinsic 
time scale of variability in the rest frame of the source is At, = DAtvar. 
The minimum source size corresponding to this variability time scale is 
R ~ cAti,, ~ cDAtyar. The solid angle subtended at the source is Q ~ 
(R/r)? ~ (cDAtyar/r)? where r is the distance to the source. The observed 
flux is then given by 


F(v) = Iv) (eee) i (9.28) 
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where I(v) is the intensity measured in the observer’s frame. We can now 
easily relate the brightness temperature T’ corresponding to a given inten- 
sity in the rest frame and the brightness temperature Tobs as determined in 
the observer’s frame corresponding to the angle 0 subtended by the source 
where Q = 62. Simple algebra now gives 


Tobs POT": (9.29) 


The physical origin of the D? factor can be understood as follows: The 
brightness temperature is related to the intensity by T œ (I,/v?)6?. If 
6 is treated as a fixed quantity, then T will scale as (I,/v?) = (IL/v*)v; 
since (I,,/v*) is invariant, this will bring in one factor of D. If we further 
estimate the size of the source and thus 6? by the time variability (rather 
than treat it as fixed), then we will get an additional D? factor from this, 
thereby producing the D? factor. This shows that there is an enhancement 
of the brightness temperature due to relativistic beaming. 

Since relativistic beaming amplifies the observed luminosity of the 
source, it will also change the luminosity function of the source. Consider, 
for example, a class of sources all having the same luminosity l but are 
moving relativistically over randomly distributed directions. The observed 
luminosity L, for a given angle w, is then given by 


1 
mm Eyen i o 

L =D", D = a Aco e) (9.30) 
where m = 2+ a for a continuous jet and m = 3+ for a blob. As the 
angle 7 varies from 0 to (7/2), the observed luminosity varies in the range 
[(2y)~™l, (2y)™1]. For a collection of sources with same y but randomly 
oriented beaming directions, the probability distribution for D is given by 
P(D)dD = sin Ydy; from this it is easy to work out the conditional proba- 
bility of observing a luminosity L for a source with luminosity | to be: 


1 
P(L|l) dl = (=) ha ie AEE el (9.31) 


If the intrinsic luminosity itself is distributed according to some luminosity 
function ®ini(l), then the observed luminosity function is given by 


Bore(L) = Í dl P(L\l) Ging (2). (9.32) 


The limits of integration will be fixed by the cut-offs on the intrinsic lu- 
minosity function as well as the range of L and y. If ®int(1) = KI" with 
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q > 1,l > min, then the observed luminosity function is given by 


K —(m+1)/ eee —q+1/ 
eee (=) L m f dime (9.33) 


where lı = max (lmin, L(2y)7™). That is, 


L-1 (for L > (2y)"™lmin) 


9.34 
L™+1)/m (for L< (27)™lmin)- ee) 


Pobs(L) x 


The physical reason for this scaling is simple: At high luminosities a whole 
range of l contributes to obs thereby maintaining the original shape, while 
for low L, the contribution comes mainly from the sources near Imin. The 
effect of beaming is thus to produce a power law luminosity function with 
a break which flattens the intrinsic luminosity function at low luminosities. 


9.3 Quasar luminosity function 


We shall hereafter concentrate on a key subclass of AGN, called quasars. 
The simplest statistical description of the abundance of quasars — or for 
that matter any other source of a particular kind —~ is by the source 
count which measures the number of sources as a function of their bright- 
ness. To see the importance of source count, let us consider a survey in 
which the apparent magnitude of all the sources brighter than a limit- 
ing magnitude mmax are measured in a given patch of the sky having a 
solid angle AQ. Translating the magnitudes into fluxes using the rela- 
tion F = Fy10°4(™°-™) we can determine the number of sources N(> F) 
brighter than a given flux which exists in the survey. If the sources are 
distributed in a spherically symmetric manner with a luminosity function 
dn/dL (which is independent of distance), then it is easy to find an expres- 
sion for N(> F) in terms of (dn/dL). We note that, for each luminosity L 
there exists a limiting luminosity distance given by dz (L) = (L/4nF)'/?. 
Hence, the number of sources which we can see with flux greater than F 
will be 
dn 
N(> F) = J dL Vidi (L, F) £ (9.35) 
where V [dz (L, F’)| is the volume within which a source of luminosity L could 
be seen with a minimum flux of L. In a Euclidean universe V = (AQ/3)d3; 
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Neuci(> F) = cama” if dL Ae. (9.36) 
Thus, for a spherically symmetric homogeneous population in Euclidean 
universe, N(> F) « F~3/? independent of the shape of the luminosity 
function (dn/dL). An important conclusion is that source counts of such a 
population in the Euclidean universe cannot be used to obtain any infor- 
mation about the shape of the luminosity function. 

Since we expect quasars to evolve over cosmic time scales, the luminosity 
function of quasars will depend on the redshift as well. Observations suggest 
that this luminosity function can be expressed in the form 


(L, z) dL = ¢" l (a5) 2 (ea) } TI (9.37) 


Using the relationships 


Mp = Mj(z) — 2.5 log (a) 


dL 
dMpg 


this can be expressed in a more convenient form in terms of the blue mag- 
nitudes as follows: 


ọ¢(MpB,z)\dMpg = ġ(L,z) 





dM p (9.38) 





ġ* dMp 


(Mz, z)dMg = 100-4IMs—-M3(2)(a+1) 4 190-41 Ma—M3 (2)] (641) (9.39) 
where 
A — 2.5k log(1 fi < Zmax 
M%(z) = pelt fe); Morz < zias) (9.40) 
M$ (Zmax) (for z > Zmax) 
with 
a=-3.9, B=-15, A=—20.9+5logh, k= 3.45, 
Zmax = 1.9, $* =5.2 x 10°h? Gpc?mag™?. (9.41) 


This function gives the number density of quasars with magnitudes between 
Me and Mg +dMpg at a redshift z and is plotted in Fig. 9.1 for different 
redshifts. [The inverse relation is based on L = Lo10%4Mo-M] with Mo = 
5.48, Lo = 2.36 x 10%% erg s~1.] In spite of significant amount of efforts 
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to understand the luminosity function of quasars at z > 3, the situation 
remains very inconclusive. 
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Fig. 9.1 Quasar luminosity function at different redshifts. 


One obvious feature of Fig. 9.1 is that the luminosity function of quasars 
does evolve in time though the actual nature of evolution is not very easy 
to determine. If the shape of the intrinsic luminosity function of quasars 
did not change with time but the comoving number density varied with z, 
then the luminosity function will move vertically in Fig. 9.1 (This is called 
pure density evolution.) The other extreme situation corresponds to the 
assumption that the space density of quasars is constant in time but the 
luminosity of the population changes with redshift (called pure luminosity 
evolution). In both the cases, one can explain the absence of very high 
luminosity quasars in the local region: In the case of density evolution, this 
is because they are spatially very rare; in the case of luminosity evolution, 
this occurs because the sources are much fainter today than they were, say, 
at z = 2. 

There is no physical basis to expect that either extreme assumption 
should hold and they merely bracket a range of more complicated possibili- 
ties. Nevertheless, it is possible to work out some physical consequences of 
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these assumptions which are illuminating. Let us begin with pure density 
evolution. Since there are fewer quasars now than in the past in this sce- 
nario, some of the quasars in the past must have turned off and the dead 
remnants of quasars must reside at the centers of galaxies, which appear to 
be normal at present. If we define the ‘bright’ galaxies to be those with, 
say, L > 0.1L*, then, from the galaxy luminosity function (see Sec. 8.5) we 
get the space density of bright galaxies to be 


oo / r, \ -107 =, L 
n@(L > 0.1L") = ¢% a (=) exp (==) d (=) 


~ 5.7 x 107h3 Gpc™?. (9.42) 


The accreted mass per dead quasar can be obtained by dividing the accu- 
mulated mass density of Eq. (9.3) by the space density of host galaxies: 


2x 10!857-!Mo Gpe? 


Mauasar = 53073 Gpo? 


~ 3.5 x 10°h3n7 Mo. (9.43) 
While this is not very high, it is at a ball-park value that should lead to 
observable effects in the nearby galaxies. 

The space density of quasars today can be determined from the lumi- 
nosity function in Eq. (9.37) and is about 1.1 x 106A? Gpce-3. Comparison 
with Eq. (9.42) shows that the fraction of bright galaxies containing ac- 
tive quasars today is about f ~ 0.019. If all the bright galaxies contained 
quasars then the density of quasars would have been higher by a factor 50 or 
so. If pure density evolution is valid, then a vertical downward translation 
by factor 50 of the z = 2 curve in Fig. 9.1 should make it coincide with the 
z = 0 curve. Such a translation, however, seem to over predict the number 
of high luminosity quasars at low redshifts. For example, calculating the 
number density of quasars brighter than Mg = —23 + 5logh at z = 2 and 
rescaling by a factor 50 will lead to a local space density of about 440h° 
Gpc~3. In a sky survey, extending up to z = 2 and covering about 25 per 
cent of the entire sky, one would expect to find about 100 such objects while 
only about 8 were found in such surveys. It is obvious that pure density 
evolution will not work. This is also, of course, clear from the fact that 
the break point in the luminosity function does evolve with z at least for 
z< 1.9. 

Another important application of the observed abundance of quasars is 
to check whether it is consistent with standard structure formation mod- 
els. Though the results are not as discriminatory as those based on the 
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abundance of clusters which we discussed in Sec. 6.4 (see Fig. 6.4), it is still 
interesting from the point of view that it probes much higher redshifts. To 
obtain such a constraint, we begin by noting that to power the quasars with 
z > 4 having typical luminosities of 10*’ergs~!, in a universe with Q = 1 
and h = 0.5 we require black holes of mass Mgy œ 10°Mo if L ~ Lg. 
From the luminosity one can also estimate the amount of fuel that must 
be present to power the quasar for a lifetime tg. If € is the efficiency with 
which the rest mass energy of the fuel is converted into radiation, then the 
fuel mass is 


Lto 9 L tg e\71 
ee eo ike (comes) (i oD ey G2 


So if the lifetime is about 108yr, and the efficiency is about ten percent, 
the required fuel mass is comparable to the mass of the central black hole. 

The mass estimated above corresponds to that involved in the central 
engine of the quasar. This mass will, in general, be a small fraction F', of 
the mass of the host galaxy. We can write F as a product of three factors: 
a fraction f, of matter in the universe which is baryonic; some fraction fret 
of the baryons originally associated with the galaxy, which was retained 
when the galaxy was formed (the remaining mass could be expelled via a 
supernova - driven wind); a fraction fhole of the baryons retained, which 
participates in the collapse to form the compact central object. All these 
factors are model dependent. A very optimistic estimate for F = fe fret fhole 
will be F ~ 0.01; a value like F ~ 107° is more likely. We then get, for the 
mass of the host galaxy 


11 L tg € ~1 F T 9.45 
EEA (orega) (5) Ga (ar) aA 


For (F/0.01) ~ (0.1—1), with the earlier value being more likely, Eq. (9.45) 
implies a mass for the host galaxy in the range of (1011 — 10!*)Mo, if other 
dimensionless parameters in Eq. (9.44) are of order unity. Therefore the 
existence of quasars at z > 4 suggests that reasonable number of objects 
with galactic masses should have formed before this redshift. 

An estimate of the fraction of the mass density, fcoun(> Ma, z), in the 
universe which should have collapsed into objects with mass greater than 
Mg at redshift z can be made using the luminosity function of quasars. We 
have 


M(M) 
p(z) feon(> M, z) _ J TMhost(M)o(M, z)dM. (9.46) 


—co 
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In the right hand side, M is the bolometric magnitude of a quasar situated 
in a host galaxy with a total mass M; (M, z)dM gives the number density 
of quasars in the magnitude interval dM. So the product Mhost(M)¢dM 
gives the collapsed mass density around quasars with magnitude in the 
range (M,M + dM). We integrate this expression up to some limiting 
magnitude M corresponding to a host mass M. Since magnitude decreases 
with increasing luminosity, this picks out objects with luminosity higher 
than some particular luminosity or — equivalently — mass higher than M. 
Finally the factor 


T = max (2. 1) (9.47) 


takes into account the possibility that the typical lifetime of a quasar tg 
may be shorter than the age of the universe at the redshift z. In that 
case, typically (¢(z)/tg) generations of quasars can exist at the particular 
redshift and we need to enhance the abundance by this factor. Given the 
luminosity function of the quasars and the relations derived above, we can 
calculate the fraction by doing the integral numerically. One finds that 


fcon (> 10!°Mo, z 2 2) = 1.4 x 107%. (9.48) 


To see what this result means, we compare this with the theoretical model 
developed in Chapter 6 which predicts fco- A comparison of Eq. (6.76) 
and Eq. (9.48) shows that at the scales corresponding to the host mass 
of quasars (which could be above 101? Mọ), we must have ø ~ 1.4 to 2.6. 
Any theoretical model for structure formation should produce at least these 
values of ø to be viable. 


9.4 Neutral hydrogen in the intergalactic medium 


Even though quasar spectra do not show any absorption due to smoothly 
distributed neutral hydrogen in the IGM (see Sec. 7.3), it does exhibit 
several other absorption features due to hydrogen. We shall first describe 
some of the broad features seen in a typical quasar spectrum and will then 
describe the absorption systems in more detail. 

Numerous absorption lines are seen in a typical quasar spectrum at 
wavelengths shortward of the Lyman—a emission line. These can be inter- 
preted as Lyman—qa absorption lines arising from clumps of gas along the 
line-of-sight to the quasar with each absorption line being contributed by a 
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particular region of IGM along the line of sight to the quasar that contains 
neutral hydrogen. The key feature which allows one to distinguish differ- 
ent Lyman-a absorption systems is the column density of neutral hydrogen 
present in each of them, which — in turn — can be determined by fitting 
the observed absorption line to a theoretically known profile (called Voigt 
profile which we discussed in Sec. 2.5.4). You will recall that Voigt profile 
is a convolution of natural line width and thermal line width through the 
formula (see Eq. (2.116)) 


7(A) = (1.498 x 107?) (2) H(a,u) (9.49) 
where 
ee, Se el ing 
ae = (uy rae "= rbd “53 (+ 3 7 5] l 
(9.50) 


The core of the line is dominated by the Gaussian profile arising from 
thermal width while the wings are dominated by the Lorentzian profile 
arising from the natural width. The optical depth at the center of the line 
is about 


is N(cm~) fro(A) 


= 1.497 x 107 
Y b(km s7?) 


(9.51) 


and essentially determines the nature of the absorption line profile which 
is observed. 
Since the photoionisation cross section for hydrogen is about og ~ 107!” 


2, a column density of N ~ op 1 provides a natural limit for photoion- 
2 


cm 
isation shielding. Lines with column densities between N ~ 10!" cm7 
and N ~ 107° cm~? are usually called Lyman limit systems. Systems with 
low column density — usually below about N ~ 10!” cm~? — are called 
Lyman-a forest. The term ‘forest’ is used to denote large number of such 
lines which are typically seen in such absorption systems. 

The actual number of absorbing systems seen in the IGM with different 
column densities of hydrogen, seem to follow a fairly simple power law over 
a wide range. This result can be expressed in a more useful form in terms 
of the probability dP that a random line of sight intercepts a cloud with a 
column density in the range (X, X + d&) at the redshift interval (z, z + dz). 
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Observations suggest that this probability is given by 


£ —8 
dP = g (£, z) dEdz ~ 91.2 (5) = 
0 


= dz, (9.52) 


with Do œ 10'4cm~?, 8 & 1.46. 


9.4.1 Lyman forest 


We shall now consider the physical properties of the absorption systems 
beginning from the Lyman-a forest which has the lowest column densities. 
These low column density systems could arise naturally from the fluctua- 
tions in the neutral hydrogen density in the IGM in the following manner. 
Consider a region of diffuse low density IGM in which the matter is in ion- 
isation equilibrium under the action of an ionizing flux J, (from different 
sources like quasars etc.) and recombination. Since the density contrast in 
the IGM will be fluctuating and evolving, we will obtain transient absorp- 
tion systems due to all the low column density neutral hydrogen regions 
along the line of sight to a quasar. The neutral hydrogen fraction in these 
systems can be estimated exactly as in Eq. (7.34) and we will get 


nH _ afny 3 19-6 tiem? (1 + 2)? ( T 


nm OR Jo 104K 





yo [1 + ô(2)]?. (9.53) 


This is same as Eq. (7.34) except for two modifications. We have introduced 
a factor [1 + 6(z)]? to take into account the density variations — 6 is the 
density contrast compared to the background density — in the IGM; we 
have changed the exponent of temperature dependence from —(1/2) to a 
more accurate value of —0.76. The neutral hydrogen density in Eq. (9.53) 
will correspond to an optical depth of 


a\2 ai 6 —0.76 
T(z) = 6.5 x 1074 (Ga) (se) u p dn() 4 — h + 6(z)]?. 
(9.54) 


where Rig = (R/107!? s7!) is the photoionisation rate defined in 
Eq. (7.32). This equation gives the typical opacity due to Lyman-a for- 
est in diffuse intergalactic medium. 

The amount of mass density contributed by these absorption systems 
is of considerable interest and — fortunately — can be estimated without 
knowing the size of the neutral hydrogen patch involved in the absorption. 
Consider a region of IGM from which Lyman-a forest absorption takes 
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place. Let us assume that the typical overdense fluctuation has a size of I, 
neutral hydrogen mass My, column density in the range (£, © + dD) and 
let the number density of overdense regions is n(z). Then we have 


dN 
dXdz 


cdt 


dd = g(Z,z)dd ~ ml?n(z) F 
z 


(9.55) 
and 
My ~ mprl? E. (9.56) 


The total density contributed by neutral hydrogen in these clouds is 


pu = n(z)My = [r= | [mpri E] = mpXg(¥, z) (=) dd. 
(9.57) 
Note that this result is independent of the size | of the clouds. The total py 
contributed by clouds with column densities in the range (Emin, Umax) can 
be obtained by integrating this expression between the two limits. Dividing 
by pc(1 + z)? one can find the Ny contributed by the neutral hydrogen in 
these clouds: 


m dz\ [Ba 
Q ESA N S 
a2) = +a (SE) Ve ee 


m Apo 1/2 1 Umax 
E (=)( 2 ) urg L Sg(B,z)dd. (9.58) 


Taking g = 91.2(£/2o)-fXz' with Dp = 1014 cm~?, 6 = 1.46, Emax © 
10?2cm~*, Emin © 10!4cm7?, we get, at z = 3, 


Sie ye 


1022 cm~? 





Qy ~2x 1070h! ( (9.59) 
For Emax % 1015 cm~?, min & 10! cm~?, which is appropriate for Lyman 
forest clouds, the corresponding result is 


Qy (Lyman forest) ~ 3 x 10-70N2h7} (9.60) 


at z = 3. 

The Qyr contributed by the ionized component in these systems can be 
much larger. In a cloud with column density X, let the plasma contributes 
a mass which is F'~! larger than that of neutral hydrogen. This fraction F 
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can be determined from Eq. (7.33) rewritten in the form 


Rnyz\? [REN 
Sf) eee SPa .61 
Np ( a ) al ) (9.61) 
where | is the characteristic scale of the system. This gives 
1/2 
NH anp ad 
F = = = — = | — i 9.62 
Np R ( Rl ) op?) 
So the dQgary due to clouds with column density in the range (X, © + dd) 
will be 
dQy [RIN (dQy 
SS —— pia, E . . 3 
dnar = ~r (5) ( e (9.63) 


The result in Eq. (9.58) is now modified to 


RI\ (mp HN? 1 Ses a 
ee ( q ) ( Pe ) ( c (1+ z)1/2 m E g(2, z)dz 


(9.64) 
Since the exponent 8 of g x EÊ is close to 1.5, the integral is close to a 
logarithm. Taking 8 = 1.5 for simplicity, we get, at z = 3, 


Jl T N 2 
re -3 040 max 
OBary z 6.3 x 10 ( h3 ) (aK z) In (==) . (9.65) 


If we take Emax = 10'5cem~? and Emin = 10!3cm~? the logarithm is about 
4.6 and we find that 


TONY T N~ 
2 —2 { Joto 
QBary =3x 10 ( he ) (sx) : (9.66) 











Finally, we note a feature about these systems which is potentially 
very important. Observations reveal that approximately 50 per cent of 
the Lyman-a lines with column densities higher than 3 x 10!4 cm~? and 
nearly all lines with column densities higher than 10! cm~? originate from 
regions which also show absorption due to carbon. While this shows that 
stellar nucleosynthesis has already distributed the metals in the IGM at 
about z = 3, the actual abundance of metals is difficult to determine. Es- 
timates center around Z = 107?-5Zo with a large — about an order of 
magnitude — scatter around this value. The metallicity of IGM at high 
redshifts, when determined more accurately, can impose strong constraints 
on the structure formation models. 
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9.4.2 Damped Lyman alpha clouds 


Historically, damped Lyman-a clouds (DLA, hereafter) were defined! as 
systems with hydrogen column densities higher than 2 x 10?° cm~?. The 
high column densities make the optical depth of these clouds sufficiently 
large for the hydrogen to be mostly neutral. The gas has a temperature 
T <3 x 10° K and could even contain molecules. The distribution of col- 
umn densities follows a slope of —1.5 even at these high column densities 
but there seems to be virtually no evolution in these systems. The num- 
ber density of these systems per unit redshift interval does not show any 
significant z dependence and is given by (dN/dz) ~ 0.3 around z ~ 2.5. 
The cosmic density of neutral hydrogen in DLA at z % 3 is comparable 
to that of stars today. All these suggests that DLAs can be thought of as 
individual clouds (unlike the low column density Lyman-a forest which is 
most likely to have originated from a fluctuating IGM). 

To begin with, (dN/dz) = B = 0.3, implies that the mean proper 
distance between the clouds along the line of sight (in a Qnr œ 1 universe) 
is about 


_ cdt dz _ c ~ 104-1 -5/2 
“dG dN BUF E O AEAN Mpe 
~ 3007} Mpc (at z = 3). (9.67) 


If we take the size of the cloud to be l = 10lgh~! Mpc, then the density 
of hydrogen atoms and the mass of neutral hydrogen in these systems are 
given by 


x 
nH œ T = 0.3hIg £22 cm7? 


My © mpl? ~ 10° Do2h-?Mo (9.68) 


where Xv is the column density in units of 10?? cm~?. The mass in neutral 
hydrogen is fairly high allowing these systems to be interpreted as pregen- 
itors of the discs. 


lYou may wonder what is “damped” about these systems. The terminology, as usual, 
has a complicated origin. We saw in Sec. 2.5.4 that the profile of an absorption line will 
be dominated by the Lorentzian wing at high column densities. This Lorentzian profile, 
in turn, can be modeled by a damped harmonic oscillator such that the natural line 
width arises from the damping of the oscillator; see Eq. (2.103). Since these systems 
with high column densities probe the natural width of the absorption line, the term 
“damped” has got affixed to them! 
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The fraction, f, of the space filled by these clouds is also of interest and 
can be estimated as follows. If n(z) is the number density of clouds, then 


x = (ni?) n(2) (9.69) 


The fraction of space filled clouds in some large volume V is 

f= (47/3) n(z)V _ 4r 
V 383 

Substituting for n(z) from Eq. (9.69) we get 


(2) OOOH e 


where L is the mean distance between the clouds along the line of sight. 
Numerically, using Eq. (9.67) we get: 


n(z)l?. (9.70) 


l 
f= 7533x 105l (at z= 3). (9.72) 
The number density of clouds will be 
Mcloud = f = 33hĉl7? Mpc? (9.73) 


with a mean inter-cloud separation of naa x 0.3h7 12/ 3 Mpc. This is 
nearly 30 times larger than the size of damped Lyman alpha systems if 
lo zi. 

The high column density of these clouds imply that they will be mostly 
neutral. The metagalactic flux of photons will be able to ionize only a 
thin layer of matter on the surface. Near the surface, equilibrium between 
ionisation and recombination (see Eq. (7.33)) requires 

NH  ANpy 
— =- 9.74 
Ma (9.74) 


leading to a proton density of 


R 1/2 RY 
Pe Nar: SNE 


ae p \i4 s 1/2 Joh 1/2 e ta 
~ 2 \ IOK 102cm? lo Say i 


which is about six times the neutral hydrogen density of ny of Eq. (9.68). 
Further, the penetration length À of photons at such high column densities 
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is low; we have, 





Rees = 0.14lgh—! pe soe =: 4 (9.76) 
= nyg eo ? p 1022 cm-2 : i 


This length is much smaller than the size of the system. (Note that 
(A/l) œ (Eor) !; so A ~ l at column densities of 5 = 10! cm~?. For 
E > 10!" cm7?, \ < l and the system is mostly neutral — which is, of 
course, what you would have expected.) Thus, most of the mass in damped 
Lyman alpha systems is contributed by the neutral hydrogen: 


MBaryons *¥ My © 10°°h-*Mo. (9.77) 


The Q contributed by damped Lyman alpha systems is quite large and 
comparable to that of luminous matter in the galaxies. We have, from 
Eq. (9.59), 


Qe 2x10-8h-) (for E = 10"em~?; z & 3). (9.78) 


The dark matter associated with such a cloud will be higher by a factor 
Q,'. If Qg ~ 0.06, this factor will be about 18. Thus the total Qnr 
associated with damped Lyman—a systems could be about (0.05 — 0.1) in 
the redshift range of z = 2 — 3. 


Exercises 


Exercise 9.1 Adding up Planck spectra: 


The analysis in Sec. 3.3.2 (see Eq. (3.54)) showed that the temperature distribu- 
tion of an accretion disk varies as T x R~*/*, Assume that the radiation from 
an AGN arises from such an accretion disk. Then the total intensity which we 
observe from an accretion disk at any given frequency v is an integral over the 
entire disc. Assuming that each circular patch of the disk emits as a blackbody at 
the temperature T(R), one can estimate the net intensity at any frequency v by 
a direct integration. Show that the intensity varies as I, œ [v?, v1/?, exp(—v/vo)] 
for low, intermediate and high frequencies. 


Exercise 9.2 Pair production and compactness: 


Radio observations of an AGN at a distance of 200 Mpc shows that at v = 1 GHz, 
the brightness is 1 mJy. The source is seen to vary at a time scale of 1 week. (a) 
Calculate the brightness temperature. (b) Assume that matter and radiation are 
in thermal equilibrium at this brightness temperature and compute the optical 
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depth for pair production taking the relevant cross section to be ayy, ~ 107” 


cm? . 


Exercise 9.3 CMB and the inverse Compton scattering: 


Suppose the relativistic electrons are injected into a hot spot of a radio source with 
a power law spectrum y~”. The electrons stay in the hot spot for some fixed time 
T before being advected away. These electrons will lose energy through inverse 
Compton scattering off the CMBR. Show that in steady state the spectrum of 
relativistic electrons will change from y~? to y7?+t® at a critical y given by 
Vo (mec? /orcUcmsT) where Ucme is the energy density of the CMBR. Further 
show that this will lead to a break in the synchrotron radiation at a frequency vp 
which varies as v « (1+ 2)7°. 


Exercise 9.4 Getting the numbers: 


(a) Consider a source at redshift z, moving across the sky with the angular proper 
motion y = dĝ/dto milliarcsec per year, in a Qur = 1 universe. Show that 


Vapp —1 H 1 
= — 94.4h eae he e : 
c (; milliarcsecond yr~! ) ( vi + =) om) 


(b) Assume that a source emits blobs of matter with speed v in a random direction. _ 
Show that the probability that the apparent transverse speed for the source will 
exceed a value Vapp is given by 


1 v2 
p(> Vapp) = T+ 2p 1- aci (9.80) 


Exercise 9.5 Numerical estimates in variability: 


The brightness temperature, Tp, for a source with brightness S at wavelength 
A and subtending an angle 0 is given by Tà ~ à? S/(2kB0?). Suppose the flux 
changes by an amount AS (in Jy), in a time scale 7 (in years), show that 


T, > (2 x 10’? K) 2, (+) (9.81) 


if we ignore relativistic effects. Estimate T, for the two sources 3C 454.3 (AS ~ 5 
Jy, T =~ 1 yr, A œ 74 cm) and AO 0235 (AS œ 1 Jy, r œ 30 days, à œ~ 11 cm). 
What is the range of Lorentz factor y and direction of motion 6 (with respect to 
the line of sight) for which relativistic effects can bring down the actual brightness 
temperature to 10!? K? 
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Exercise 9.6 Jet scaling: 


Consider a jet with constant opening angle and assume that the surface bright- 
ness (luminosity per unit projected surface area) scales as r~*> where r is the 
distance along the jet. (a) Determine the scalings for (i) the equipartition pres- 
sure in the jet, (ii) equipartition magnetic field strength and (iii) lifetime due 
to synchrotron cooling. (b) Suppose the jet has a constant bulk Lorentz factor 
and that the synchrotron radiation is due to relativistic electrons with energy 
distribution N(y) « y~?. Show that if the electron gas expanded rapidly and the 
jet was dominated by a transverse magnetic field, then the surface brightness will 
scale as r~ 19/6, 
|Answer: (a) Constant opening angle 0 = R/r implies r x R. The volume 

emissivity is then e œ IL/R œ r~*°, The volume emissivity in synchrotron 
radiation scales as €, « ne(y?B?/v) ~ pe(B?y/v) where pe ~ neyme is the 
pressure of the electron gas. ume that most of the radiation is at v ~ ve x 

+B, we get év X peB?/ vz Y? œ Bu?ve'/? if pe x B2, under the assumption 
of equipartition. Comparing e « r~?> with vi es œ B3°, we find that the 
equipartition field Beg x r7} and the equipartition pressure scales as r~?. The 
synchrotron cooling time varies as tooo! = E/E œ y/7°B? « v7? B-3/? q& 73/2, 
(b) For a relativistic gas expanding adiabatically, p x p% x R-8/3_ Equating 
this to pe x ney x y7(dn/dy), we get y?dn/dy x R78/3. The emissivity varies 
as ey œ (y dn/dy)B?/ 2v7"? œ B3/2y-1/2R-8/3_ Using the result B œ R`}, 
we get ep x v7? R?5/6, The corresponding surface brightness is I, « e R œ 
v12 R-19/6 x r71, Since the observed distribution scales as I, œ r725, there 
must be a process of continuous acceleration to restore the energy lost due to 
adiabatic expansion.] 


Exercise 9.7 K correction: 


While observing sources like quasars, with redshifts comparable (or larger) to 
unity, it is necessary to take into account two simple kinematic effects: (i) The 
frequency at which the radiation is detected is not the same as the frequency at 
which it is emitted; (ii) the bandwidth over which the observations are carried 
out will not be the same as the bandwidth over which the radiation is emitted 
due to redshift. It is conventional to write the apparent magnitude of the object 
in the form Mintrinsic = Mobservea — K(z), where K(z) is called the K-correction. 
(a) Show that K(z) = Kı + Ke where K = 2.5log(1+ z) and 


i F(A) S(d) dd 


Ko =25log 2 
i F(A/[1 + z]) S(A) aa 


(9.82) 


where F(A) is the spectral energy distribution of the source and S(A) is the filter 
function determining the wave band over which observations are carried out. (b) 
Consider a power law source with (F « v~% and show that, in this case, the total 
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K-correction is given by 


K(z) = 2.5(a — 1) log(1 + 2). (9.83) 


Exercise 9.8 Growing a monster: 


Consider a black hole which grows by accreting matter and radiating at Eddington 
luminosity with the efficiency n = 0.1. If it starts at 10 Mo at the redshift z;, at 
what redshift will it grow to 10°Mo ? Alternatively, assume that the black hole 
should have grown to this value by, say, z = 5. What constraint does it put on 
the initial epoch? Does this sound reasonable? 


Exercise 9.9 Galactic center: 


Several pieces of evidence suggests that there is a supermassive black hole at 
the center of our galaxy. Suppose it had been active. How will the spectrum of 
radiation from this region look like? Assume that it has M = 10°Mo, L = Leaa 
and a radiation spectrum which is v~!/3 between v = 10’? and 10?! Hz. Use this 
to compute the apparent V-band magnitude as measured from Earth. 


Exercise 9.10 Iron line: 


The FeKa line representing an n = 2 — 1 transition in iron is a favourite of 
x-ray astronomers. Based on the temperature scaling, what is the mass range of 
the compact objects in which you will expect to see this line — in XRBs or in 
AGN ?[Hint: Remember that we want n = 2 to be reasonably populated but the 
ionisation fraction should not be too high.] 


Exercise 9.11 Superluminal motion: 


In the quasar 3C 273 at z = 0.158, one component is seen to move away from 
the nucleus with an angular velocity 2.2 x 107° arcsec yr~’. (a) What is the 
projected linear velocity of this component? (b) Assuming that the superluminal 
motion which you found in part (a) above is due to the effect discussed in Sec. 9.2, 
estimate the angle between the line-of-sight and the direction of ejected matter 
that will produce the maximum apparent velocity for the ejected component. 
(c) What is the minimum ejection velocity required to produce the apparent 
superluminal speed you obtained in (a)? 


Exercise 9.12 Real life astronomy: Quasars: 


The SDSS web-site has a good navigation tool (http://skyserver.sdss.org) which 
allows you to explore the objects detected in these surveys. Try, as an example, 
the coordinates around RA = 176.764° and DEC = 1.412°. There is a quasar in 
the field which you can click on and look at its spectrum (does it look like a quasar 
spectrum?) and its redshift. (a) Given the redshift and apparent R magnitude, 
find the absolute luminosity using the correct luminosity distance formula. Use 


Chapter 9: Active Galaxies 345 


two cosmological models, one with Qnr = 1 and the other with Qnr = 0.3 and 
Qy = 0.7. How much does the estimated luminosity change between these two 
models? (b) If the quasar is emitting in the Eddington limit, determine the 
mass of the central black hole for reasonable efficiencies. (c) The spectrum you 
obtained does not cover all the electromagnetic bands. Often one would like to 
know whether the same object emits, say, in radio or in x-rays. Try the VLA 
FIRST survey (check the site http://sundog.stsci.edu) at these coordinates to 
determine whether this quasar has a radio flux. How will you check its X-ray 
properties ? 


Notes and References 
Some of the specialist textbooks dealing with AGN are: Peterson (1997), Frank 


King and Raine (1992), Osterbrock (1989), Ferland and Baldwin (1999); also 
see chapter 8 of Padmanabhan (2002). 
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Appendix 


Range of Physical Quantities in 
Astrophysics 


This Appendix summarizes different physical parameters we encounter in 
the study of astrophysics — like mass, length, time, angular scales, velocity, 
energy, power, energy density, temperature, pressure and magnetic field. 
The numbers are approximate and are intended to give an idea of the 
range. In each of the tables, the values are given in two columns: the first 
column gives it in a fundamental unit while the second column gives a more 
appropriate astronomical unit when it iss relevant. The fundamental unit 
used in each of the tables is indicated in the caption. 


Mass (gm) 
Electron mass 9.11 x 10728 
Proton mass 1.67 x 10-4 
Planck mass((fic/G)!/2 2.2 x 1075 
Mass of Moon 7.3 x 10% (1/80)Me 
Mass of Earth 5.98 x 10?7 1Me 
Mass of Jupiter 1.9 x 1030 10-3 Mo 
Mass of the Sun 1.99 x 1038 1 Mo 
Chandrasekhar mass 2.8 x 1033 1.4 Mo 
Mass of interstellar clouds 2 x 10°" 10? Mo 
Mass of a globular cluster 1 x 10°9 5x 10° Mo 
Mass of Milky Way 2.6 x 1045 1.4 x 10!? Mo 
2.7 x 1048 1.3 x 1015 Mo 


Virial mass of the Coma cluster 
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Length (cm) 


Planck length(Gh/c?)2 

Proton Compton wavelength(h/mpc) 
Classical electron radius(e?/mec?) 
Electron Compton wavelength(h/m.c) 
Bohr radius(h?/mee?) 

Molecular mean free path in the atmosphere 
Optical photon wavelength 

Neutron star radius 

Radius of the Earth 

Radius of Jupiter 

Distance to the Moon 

Radius of the Sun 

Earth-Sun mean distance 

Pluto-Sun mean distance 

1 light-year (ly) 

1 parsec (pc) 

Nearest star to Sun 

Radius of Crab Nebula 

Radius of globular cluster 

Scale height of the Milky Way disk 
Distance to the Crab Nebula 
Distance from Sun to galactic center 
Distance to the LMC 

Length of a radio jet 

Distance to Andromeda 

Distance to Coma cluster 


1.6 x 10733 

1.3 x 10738 

2.8 x 10738 

2.4 x 10710 

5.3 x 1079 0.5 Å 
7.0 x 1078 700 A 
5x 1075 5000 A 
106 

6.3 x 108 1Re 
7.1 x 10° 11.3Re 
3.8 x 101° 60Re 
7.0 x 1010 1Ro 


1.50x 103 1 AU 
5.91 x 10!4 39.4 AU 
9.46 x 1017 

3.08 x 101% 3.26 ly 


4 x 1018 

4.5 x 1028 
1.5 x 1020 
2 x 102! 

6.6 x 107! 
2.4 x 102? 
1.5 x 1023 
1 x 1074 

1.9 x 1074 
4.1 x 1026 


Time (seconds) 


Minimum possible spin period for a 
neutron star 

Spin period of pulsar PSR 1957+20 

Sun’s free fall time scale (1/./Gp) 

Earth rotation period 

Earth orbital period 

Main sequence lifetime for a 30 Mo star 


8 x 1074 
1.6 x 1073 
2000 

8.6 x 104 
3.2 x 107 
1.6 x 1014 


1.3 pe 
1.46 pe 
50 pe 
700 pc 
2.2 kpc 
8 kpe 
50 kpc 
300 kpc 
620 kpc 
132 Mpc 


33 min 

24 hrs 

l yr 

5.0 x 108 yr 


Appendiz: Range of Physical Quantities in Astrophysics 349 


Sun’s thermal time scale (GMo/R Lo) 
Main sequence lifetime for a 5 Mo star 
Orbital time of sun in the galaxy 

Age of Earth and Sun 

Main sequence lifetime for Sun 

Age of an old globular cluster 

Age of the universe 


6.3 x 1014 
1.9 x 1015 
7.3 x 1015 
1.5 x 101” 
3 x 1017 

4 x 1017 

4.1 x 1017 


Angular scales (arcseconds) 


Resolution of radio telescope (VLBA) 0.0002 
Accuracy attainable in measuring angular 

positions of stars 0.0015 
Hubble Space Telescope angular resolution 0.04 
Atmospheric seeing limit for optical telescope 0.6 
Gravitational deflection of a light by the Sun 1.75 


Angular diameter of typical galaxy at z = 0.5 2.9 


Angular diameter of Jupiter 

Resolution limit of eye 

Angular diameter of the Crab Nebula 
Angular diameter of the Moon/Sun 
Angular scale of deg imprinted on CMBR 
Angular diameter of Andromeda 


36 
120 
300 
1865 
3100 
14400 


Velocity (cm s—') 


Reflex speed induced on Sun by Jupiter 

Sound speed in air 

Velocity of molecules in the atmosphere 

Escape velocity from Moon 

Escape velocity from Earth 

Stellar velocity dispersion in a globular 
cluster 

Earth’s orbital speed around the Sun 

Velocity of Sun in Milky Way’s center 

Escape velocity from Sun 

Velocity dispersion of galaxies in a cluster 

Escape velocity from neutron star surface 


1.3 x 103 
3.3 x 104 
4.8 x 104 
2.4 x 10° 
1.1 x 108 


2 x 10° 
2.9 x 108 
2.2 x 107 
6.2 x 107 
8 x 107 
2 x 101° 


2.1 x 107 yr 
5.9 x 107 yr 
2.3 x 108 yr 
4.7 Gyr 

9.4 Gyr 
12.5 Gyr 

13 Gyr 


2 minutes 
5 minutes 
30 minutes 
1 degree 

4 degree 


2.4 km s7! 
11 km s7! 


20 km s~! 

29 km s7! 

220 km s7! 
620 km s71 
800 km s7! 
2x 105 km s7! 
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Energy (ergs) 


Hydrogen hyperfine transition 

Molecular rotation transition 

Thermal energy of atmosphere (kTatmos) 

Molecular vibration transition 

Hydrogen binding energy 

Electron rest mass 

Energy from 4 x (1H) — 4He fusion 

Proton rest mass 

Rotational energy of the Sun 

Gravitational internal binding energy of 
Jupiter 

Photon energy from a supernova 

Gravitational binding energy of a galaxy 

Gravitational binding energy of a cluster 


Power (watts) 


Human being 
Luminosity of a white dwarf 


Gravitational wave radiation from PSR 1913416 


Solar luminosity 
Crab Nebula energy output 


Eddington luminosity of a 1.4 Mo neutron star 


Peak photon luminosity of supernova 
Milky Way luminosity 

X-ray luminosity of gas in coma cluster 
Quasar luminosity 


9.5 x 10718 
4 x 10715 
4 x 10714 
3 x 1978 
2.2 x 107! 
8.18 x 1077 
4.3 x 1075 
1.50 x 1073 
2.5 x 1042 


3 x 1048 
3 x 105! 
2 x 108° 
5 x 1084 


150 


5.8 x 1078 eV 
2.4 x 1073 eV 
2.4 x 107? eV 


0.18 eV 

13.6 eV 

0.5 MeV 
26 MeV 

1 GeV 


1 x 10? 


8 x 1024 


3.9 x 1076 
1 x 103! 


2x 1031 


5 x 10% 
3 x 1076 
1.5 x 1038 
1 x 1039 


Density (gm cm~*) 


Density of the starlight released in a Hubble time 
Density of the cosmic microwave background radiation 
Baryon density predicted by primordial nucleosynthesis 


Critical density of the universe (3H?/87G) 
Gas density in a cluster of galaxies 

Gas density in the interstellar medium 
Density of air 

Mean density of Jupiter 


1x 107% 
4.6 x 10734 
2x 1073! 
4.7 x 10730 
2 x 10727 
3 x 10724 
1.3 x 1073 
1.3 
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Mean density of the Sun 1.4 
Mean density of the Moon 3.3 
Mean density of the Earth 5.5 
Mean density of a white dwarf 5 x 104 
Nuclear density 6 x 104 
Central density of a neutron star 1 x 1035 
Planck density(c°/G?h) 5 x 10% 


Temperature (Kelvin) 


CMBR quadrupole anisotropy 1.3 x 1075 
CMBR dipole anisotropy 3.3 x 1073 
CMBR temperature 2.726 
Surface temperature of Venus 740 
Melting point of iron 1811 

Solar surface temperature 5770 
Center of the Sun 1.4 x 107 
Gas temperature in a cluster of galaxies 5 x 107 


Characteristic temperature forete~ pair production 4x 108 


Pressure (dynes cm~?) 


Gas pressure in a cluster of galaxies 7 x 1071? 
Pressure in the best vacuum achieved on Earth 5 x 1071 
Solar radiation pressure at the Earth 5x 1075 
Atmospheric pressure 1 x 108 
Central pressure of Earth 5 x 10)? 
Central pressure of Jupiter 4 x 109}8 
Central pressure of the Sun 2.7 x 101” 
Central pressure of a white dwarf 1 x 1024 
Central pressure of a neutron star 1 x 10% 


Magnetic field (gauss) 


Magnetic field in the local interstellar medium 5 x 1076 
Magnetic field in a radio lobe of a radio galaxy 1x 1075 
Magnetic field at Earth’s surface 3 x 107} 
Magnetic field near Sun’s pole 1 


Magnetic field felt by the electron in an n = 1 hydrogen atom 2x 104 
Pulsar dipole magnetic field 1 x 10!” 
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21 cm radiation, 78 
order of magnitude, 81 


accretion, 202 
acoustic peaks, 257 
active galactic nuclei, 317 
black hole paradigm, 318 
estimates of, 323 
general properties of, 317 
radio jets, 325 
scaling, 320 
spectral energy distribution, 318 
adiabatic processes, 98 
Alfven radius, 111 
angular diameter distance, 222 
atmospheric turbulence, 119 
atomic energy levels 
order of magnitude, 75 


baryons 
constraints on, 230 
DLAS, 341 
lyman alpha forest, 336 
quasar absorption systems, 336 

beta decay, 172 

binary stars, 202 

black hole, 197 
effective potential, 200 
in galactic center, 285 
mass of the spheroid, 287 
radius of influence, 285 
temperature, 198 


bolometric magnitude, 141 
Boltzmann equation 

isothermal sphere, 105, 297 

Kalnajs disk, 299 

King model, 297 

Mestel’s disk, 298 

polytropes, 296 

solutions of, 295 
Boltzmann statistics, 51 
Bose-Einstein statistics, 51 
bound bound cross section 

order of magnitude, 82 
_bremsstrahlung, 60 

order of magnitude, 60 
brightness temperature, 50, 51 
brown dwarfs, 166 


Chandrasekhar limit, 196 
classical electron radius, 82 
classification of stars, 144 
clusters, 310, 311 
abundance, 242 
bremsstrahlung, 61 
cooling flow, 61 
dark matter, 21 
CNO cycle, 150 
colour index, 143 
Compton-y parameter, 73 
convection, 148 
cooling curve, 86 
cooling of baryons, 262 
correlation function, 310 
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Cosmic Microwave Background 
Radiation, 52, 69, 215, 231, 251 
causes of anisotropy, 251 
intrinsic fluctuations, 253 
radio jets, 325 
Sachs- Wolfe effect, 257 

Cosmic Microwave Background 
Radiation anisotropies, 252 
large angular scales, 252 
small angular scales, 254 

cosmological constant, 216 

Crab nebula, 66 

Crab pulsar, 196 

critical density 
definition of, 214 

crystallization, 189 


Damped Lyman Alpha Systems, 339 
estimates, 340 
quasar absorption systems, 339 
dark energy, 216 
dark matter, 216, 289 
de Vaucouleurs law, 292 
Debye length, 126 
density cusp, 292 
density parameter, 214 
baryons, 230 
definition of, 214 
diffusion coefficient, 191 
dipole radiation, 59 
disk galaxies 
central surface brightness, 284 
surface brightness of, 283 
dispersion measure, 129 
distance measure 
angular diameter, 222 
luminosity, 221 
distribution of matter, 310 
Doppler effect 
temperature, 121 
Doppler width, 88 
dynamical friction, 27 


early universe 
nucleosynthesis, 223 
time-temperature relation, 223 


Earth-Moon system, 15 
Eddington luminosity, 58, 203, 323 
order of magnitude, 204 
Einstein ring, 37, 38 
electromagnetic waves 
Lorentz transformation, 53 
ellipticals, 282 
fundamental plane, 293 
equation of state, 95 
Fermi gas, 101 
Euler’s equations, 13 
expansion factor, 212 
explosions 
shock waves, 116 
extra-solar planets, 9 
Extragalactic Background Light, 266, 
271 
far infrared, 272 
optical, 272 


Faber-Jackson relation, 294 
Faraday rotation, 127 
Fermi energy, 98 

order of magnitude, 99 
Fermi momentum, 98 
fine structure, 77 

order of magnitude, 77 
free-free absorption, 61 
friedmann model 

different epochs, 218 
fundamental plane, 292 


galaxies 
angular momentum, 288 
catalogues, 310 
colours of, 301 
correlation function, 310 
disk size, 291 
evolution of, 302 
luminosity function, 307 
mass of, 263 
models for, 294 
morphology of, 266 
order of magnitude, 263 
spectral energy distribution of, 299 
galaxy counts, 306 


galaxy formation 
time scales of, 261 
galaxy luminosity 
evolution of, 304, 305 
galaxy luminosity function, 305 
galaxy mass 
order of magnitude, 263 
galaxy models 
axisymmetric potential, 298 
isothermal sphere, 105, 297 
King model, 297 
polytropes, 296 
general relativity, 28 
giant branch, 168 
gravitational binding energy, 136 
gravitational lensing, 34 
microlensing, 38 
point mass, 37 
gravitational potential, 1 
and clock, 30 
gravitational radiation, 205 
gravitational relaxation, 26 
growth of perturbations 
simple derivation, 234 
Gunn-Peterson effect, 272, 273 
intergalactic medium, 272 


Harrison-Zeldovich spectrum, 250 
helium abundance, 228 
Hertzsprung-Russel diagram, 141 
order of magnitude, 139 
theoretical slope, 154 
hierarchical clustering, 239 
High Mass X-ray Binaries, 203 
hodograph, 4 
homology relations, 153 
Hubble classification, 283 
Hubble constant, 214 
definition of, 214 
Hubble deep fields, 306 
Hubble radius, 220 
definition of, 220 
Friedmann universe, 244 
hyperfine structure 
order of magnitude, 77 
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ideal fluid 
with gravity, 101 
ideal gas, 97 
properties of, 98 
inflation, 244 
early universe, 246 
slow roll-over, 247 
initial mass function, 165 
intensity, 47 
intergalactic medium, 265, 272, 334 
energy balance, 277 
ionisation of, 274 
photoionisation equilibrium, 275 
interstellar medium, 87, 184 
inverse Compton scattering, 67, 69 
inverse-square law, 1 
ionisation 
hydrogen, 122 
isothermal sphere, 104 
properties of, 105 


Jeans mass, 25, 163 


Kelvin-Helmholtz time scale, 159 
Kepler problem 

binary system, 41 
Kepler’s second law, 4 
King model, 297 
Kolmogorov spectrum, 119 
Kramer’s opacity, 62 


Lagrange points, 10 
Lane-Emden, 102 
polytropes, 104 
Larmor radius, 64, 187 
Larmor’s formula, 46 
latus rectum, 6 
line width, 82 
Low Mass X-ray Binaries, 204 
luminosity, 140 
luminosity distance, 221 
definition of, 221 
luminosity function 
galaxies, 305 
quasar, 329 
lyman alpha absorption systems, 335 
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lyman alpha forest, 335, 336 


magnitude, 141 
main sequence, 168 
mass of 
planet, 136 
star, 138 
mass-to-light ratio, 21 
mean-free-path 
photons, 72 
metal production, 266 
microlensing, 38 
molecular binding energy 
order of magnitude, 78 
molecular energy levels, 78 
molecular weight 
definition of, 98 
monatomic gas, 24 


natural width, 82 
negative hydrogen ion, 87 
neutrino oscillations, 160 
neutron decay, 227 
neutron star, 173, 194 
nucleosynthesis, 222 


opacity, 55, 68, 84, 149 

optical depth, 55, 57 

origin of perturbations 
inflation, 244 


p-p reactions, 150 
pair production, 323 
Pauli exclusion principle, 98 
period-luminosity relation, 156 
photo-ionisation, 83 

order of magnitude, 84 
photon opacity 

order of magnitude, 56 
photon-to-baryon ratio, 223 
planetary nebula, 170 
planets 

tidal friction, 15 
plasma, dispersion, 127 
plasma frequency, 126 
Plummer model, 296 


Poisson’s equation, 2 
polarization 

synchrotron radiation, 67 
polytropes 

self gravitating systems, 102 
polytropic equation of state, 102 
Pop I, 153 
Pop II, 153 
precession, 8 

free, 12 

nutation, 15 

of equinoxes, 12 
Press-Schechter formula, 241 
pressure 

general expression, 96 
principle of equivalence, 2, 30, 31 
principle of superposition, 1 
protogalaxies, 272 

intergalactic medium, 279 

ionisation, 279 
pulsar, 194 

gravitational radiation, 205 

magnetosphere, 195 


quasar absorption system 
classification of, 335 
estimates of, 336 
voigt profile, 335 
quasar abundance, 330, 333 
quasar luminosity function, 329 
quasars, 275 
absorption systems, 335 
abundance of, 333, 334 
black hole paradigm, 332 
Gunn-Peterson effect, 273 
K correlation, 343 
luminosity function, 343 
mass of, 333 
scaling, 332 


radiation 
beaming, 55 
bremsstrahlung, 59 
classical theory, 44 
dipole, 46 
plane wave, 52 


quantum theory, 46, 74 

synchrotron, 63 

thermal, 49 
radio jets, 325 

brightness temperature, 327 

bulk motion, 326 

super luminal motion, 326 
Rayleigh-Jeans regime, 51 
recombination 

optical depth, 233 
recombination rate, 83 

order of magnitude, 84, 140 
Red giant, 170 
redshift, 215 

gravitational, 32 

of recombination, 231 
relativistic beaming, 55 

luminosity function, 328 
relaxation time-scale, 25 
Reynolds number, 117, 119 
Roche radius, 10, 19, 204 
rotation measure, 130 
Runge-Lenz vector, 7 


Sachs- Wolfe effect, 254 
Saha’s equation, 121, 231 
scale invariant perturbations, 250 
Schecter luminosity function, 306 
Schwarzschild metric, 33 
effective potential, 200 
Schwarzschild radius, 3, 34 
Sedov solution, 116 
shape of Earth, 12 
shock waves, 114 
jump conditions, 114 
properties, 114 
Sedov solution, 116 
solar neutrino problem, 160 
solar wind, 161 
specific heat, 22 
spectral energy distribution, 299 
spherical accretion, 106, 107 
spherical top hat model, 235 
explicit results, 237 
spontaneous emission, 47 
standard solar model, 157 
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star formation, 266, 268, 302 
mass scales, 163 
star formation rate 
extragalactic background light, 271 
Ha luminosity, 268 
timescales, 269 
UV luminosity, 268 
stellar atmosphere, 155 
stellar mass 
order of magnitude, 138 
stellar oscillations, 156 
stellar remnants, 188 
stellar spectra, 145 
stellar structure, 147 
convection, 149 
energy generation, 151 
scaling solutions, 151 
stimulated emission, 47 
Stromgen spheres, 276 
strong explosion, 116 
structure formation 
scaling laws, 239 
sub-sonic flow, 109 
sun 
internal structure, 157 
Sunyaev-Zeldovich effect, 73 
super luminar motion, 325 
superclusters, 310 
supernova, 172, 274 
supernova remnants, 179 
different phases, 179 
sedov phase, 180 
supersonic flow, 106, 109 
spherical accretion, 107 
surface of last scattering 
redshift, 232 
thickness, 232 
synchrotron radiation, 63, 323 
cooling, 63 
order of magnitude, 63 
polarization, 67 
spectrum, 64 


thermal radiation, 51 
thin accretion disk, 111 
Thomson opacity 
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order of magnitude, 68 
Thomson scattering, 68 

cross section, 68 
tides, 17 
transition rate 

order of magnitude, 47 
transport coefficients, 190 
triple-alpha reaction, 229 
Trojan asteroids, 11 
Tully-Fisher relation, 287, 288 
turbulence, 117 

Kolmogorov spectrum, 119 


vacuum energy, 217 
variable stars, 156 
velocity space, 3 


virial theorem, 19 
examples, 24 
viscosity 
energy dissipation, 118 
Voigt function, 88 


white dwarf, 170, 188 
cooling, 192 
luminosity, 192 
order of magnitude, 170, 189 
surface structure, 190 
Wien’s regime, 51 


x-ray binaries, 203 


Zero Age Main Sequence, 150 


